Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells10000
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Numeric1
Text2
Categorical1
Unsupported1

Dataset

Description부산광역시_제로페이가맹점현황_20230630
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15078025

Alerts

데이터기준일자 has constant value ""Constant
Unnamed: 4 has 10000 (100.0%) missing valuesMissing
연번 has unique valuesUnique
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-21 09:32:58.590923
Analysis finished2024-04-21 09:33:01.210420
Duration2.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49680.67
Minimum10
Maximum99999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T18:33:01.348941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile5340.4
Q124800.75
median49351.5
Q374609.5
95-th percentile95133.1
Maximum99999
Range99989
Interquartile range (IQR)49808.75

Descriptive statistics

Standard deviation28847.164
Coefficient of variation (CV)0.58065167
Kurtosis-1.1965128
Mean49680.67
Median Absolute Deviation (MAD)24928
Skewness0.025638341
Sum4.968067 × 108
Variance8.3215886 × 108
MonotonicityNot monotonic
2024-04-21T18:33:01.599971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
80196 1
 
< 0.1%
30628 1
 
< 0.1%
41320 1
 
< 0.1%
84913 1
 
< 0.1%
66203 1
 
< 0.1%
72734 1
 
< 0.1%
52565 1
 
< 0.1%
21625 1
 
< 0.1%
55581 1
 
< 0.1%
81495 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
10 1
< 0.1%
12 1
< 0.1%
49 1
< 0.1%
50 1
< 0.1%
51 1
< 0.1%
58 1
< 0.1%
67 1
< 0.1%
84 1
< 0.1%
94 1
< 0.1%
119 1
< 0.1%
ValueCountFrequency (%)
99999 1
< 0.1%
99991 1
< 0.1%
99988 1
< 0.1%
99986 1
< 0.1%
99980 1
< 0.1%
99973 1
< 0.1%
99971 1
< 0.1%
99968 1
< 0.1%
99958 1
< 0.1%
99952 1
< 0.1%
Distinct9743
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T18:33:02.775953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length7.1202
Min length1

Characters and Unicode

Total characters71202
Distinct characters1101
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9535 ?
Unique (%)95.3%

Sample

1st row세븐일레븐 부산부전에코점
2nd row보임미디어
3rd row아모레 카운셀러_성종*
4th row다통돼지볶음
5th row불맛석쇠쭈꾸미&빈대떡
ValueCountFrequency (%)
아모레 196
 
1.5%
한국야쿠르트 116
 
0.9%
㈜비지에프네트웍스 98
 
0.7%
gs 79
 
0.6%
postbox 79
 
0.6%
세븐일레븐 77
 
0.6%
주식회사 67
 
0.5%
롯데택배 55
 
0.4%
씨유 49
 
0.4%
이마트24 49
 
0.4%
Other values (10598) 12293
93.4%
2024-04-21T18:33:04.236504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3196
 
4.5%
1957
 
2.7%
1407
 
2.0%
1334
 
1.9%
897
 
1.3%
875
 
1.2%
850
 
1.2%
804
 
1.1%
) 778
 
1.1%
( 774
 
1.1%
Other values (1091) 58330
81.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60036
84.3%
Space Separator 3196
 
4.5%
Uppercase Letter 2243
 
3.2%
Lowercase Letter 1863
 
2.6%
Decimal Number 1439
 
2.0%
Close Punctuation 815
 
1.1%
Open Punctuation 811
 
1.1%
Other Punctuation 456
 
0.6%
Connector Punctuation 202
 
0.3%
Other Symbol 113
 
0.2%
Other values (2) 28
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1957
 
3.3%
1407
 
2.3%
1334
 
2.2%
897
 
1.5%
875
 
1.5%
850
 
1.4%
804
 
1.3%
741
 
1.2%
728
 
1.2%
669
 
1.1%
Other values (1008) 49774
82.9%
Uppercase Letter
ValueCountFrequency (%)
S 321
14.3%
G 260
 
11.6%
A 128
 
5.7%
C 123
 
5.5%
E 116
 
5.2%
O 111
 
4.9%
V 107
 
4.8%
T 99
 
4.4%
B 99
 
4.4%
P 92
 
4.1%
Other values (16) 787
35.1%
Lowercase Letter
ValueCountFrequency (%)
o 266
14.3%
e 165
 
8.9%
s 156
 
8.4%
t 140
 
7.5%
a 138
 
7.4%
b 110
 
5.9%
p 106
 
5.7%
i 101
 
5.4%
n 100
 
5.4%
r 82
 
4.4%
Other values (15) 499
26.8%
Other Punctuation
ValueCountFrequency (%)
* 194
42.5%
& 92
20.2%
. 81
17.8%
, 42
 
9.2%
13
 
2.9%
' 10
 
2.2%
# 9
 
2.0%
4
 
0.9%
/ 4
 
0.9%
; 2
 
0.4%
Other values (3) 5
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 348
24.2%
1 222
15.4%
5 206
14.3%
4 145
10.1%
0 124
 
8.6%
3 103
 
7.2%
8 81
 
5.6%
7 76
 
5.3%
9 74
 
5.1%
6 60
 
4.2%
Close Punctuation
ValueCountFrequency (%)
) 778
95.5%
37
 
4.5%
Open Punctuation
ValueCountFrequency (%)
( 774
95.4%
37
 
4.6%
Space Separator
ValueCountFrequency (%)
3196
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 202
100.0%
Other Symbol
ValueCountFrequency (%)
113
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60125
84.4%
Common 6947
 
9.8%
Latin 4106
 
5.8%
Han 24
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1957
 
3.3%
1407
 
2.3%
1334
 
2.2%
897
 
1.5%
875
 
1.5%
850
 
1.4%
804
 
1.3%
741
 
1.2%
728
 
1.2%
669
 
1.1%
Other values (987) 49863
82.9%
Latin
ValueCountFrequency (%)
S 321
 
7.8%
o 266
 
6.5%
G 260
 
6.3%
e 165
 
4.0%
s 156
 
3.8%
t 140
 
3.4%
a 138
 
3.4%
A 128
 
3.1%
C 123
 
3.0%
E 116
 
2.8%
Other values (41) 2293
55.8%
Common
ValueCountFrequency (%)
3196
46.0%
) 778
 
11.2%
( 774
 
11.1%
2 348
 
5.0%
1 222
 
3.2%
5 206
 
3.0%
_ 202
 
2.9%
* 194
 
2.8%
4 145
 
2.1%
0 124
 
1.8%
Other values (21) 758
 
10.9%
Han
ValueCountFrequency (%)
3
 
12.5%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (12) 12
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60011
84.3%
ASCII 10961
 
15.4%
None 205
 
0.3%
CJK 24
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3196
29.2%
) 778
 
7.1%
( 774
 
7.1%
2 348
 
3.2%
S 321
 
2.9%
o 266
 
2.4%
G 260
 
2.4%
1 222
 
2.0%
5 206
 
1.9%
_ 202
 
1.8%
Other values (67) 4388
40.0%
Hangul
ValueCountFrequency (%)
1957
 
3.3%
1407
 
2.3%
1334
 
2.2%
897
 
1.5%
875
 
1.5%
850
 
1.4%
804
 
1.3%
741
 
1.2%
728
 
1.2%
669
 
1.1%
Other values (985) 49749
82.9%
None
ValueCountFrequency (%)
113
55.1%
37
 
18.0%
37
 
18.0%
13
 
6.3%
4
 
2.0%
· 1
 
0.5%
CJK
ValueCountFrequency (%)
3
 
12.5%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (12) 12
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct9757
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T18:33:05.556890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length68
Mean length32.4312
Min length15

Characters and Unicode

Total characters324312
Distinct characters996
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9656 ?
Unique (%)96.6%

Sample

1st row부산광역시 부산진구 중앙대로 653 (부전동)세븐일레븐 부산부전에코점
2nd row부산광역시 부산진구 신천대로220번길 93-9보임미디어
3rd row부산광역시 동래구 아시아드대로 225 3층 (온천동,미남메디칼센터)빌딩 및 상가 內
4th row부산광역시 중구 중구로23번길 91층
5th row부산광역시 영도구 남항로49번길 42(영선동1가)
ValueCountFrequency (%)
부산광역시 9999
 
18.7%
부산진구 1273
 
2.4%
해운대구 1012
 
1.9%
동래구 800
 
1.5%
사하구 755
 
1.4%
금정구 692
 
1.3%
사상구 668
 
1.3%
남구 637
 
1.2%
수영구 608
 
1.1%
강서구 575
 
1.1%
Other values (16439) 36398
68.1%
2024-04-21T18:33:07.211131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43681
 
13.5%
1 13019
 
4.0%
12791
 
3.9%
12771
 
3.9%
11197
 
3.5%
11095
 
3.4%
10801
 
3.3%
10335
 
3.2%
10127
 
3.1%
9567
 
2.9%
Other values (986) 178928
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 205134
63.3%
Decimal Number 51839
 
16.0%
Space Separator 43681
 
13.5%
Open Punctuation 7481
 
2.3%
Close Punctuation 7477
 
2.3%
Other Punctuation 3537
 
1.1%
Dash Punctuation 1903
 
0.6%
Uppercase Letter 1725
 
0.5%
Lowercase Letter 1384
 
0.4%
Other Symbol 103
 
< 0.1%
Other values (4) 48
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12791
 
6.2%
12771
 
6.2%
11197
 
5.5%
11095
 
5.4%
10801
 
5.3%
10335
 
5.0%
10127
 
4.9%
9567
 
4.7%
4688
 
2.3%
4398
 
2.1%
Other values (890) 107364
52.3%
Uppercase Letter
ValueCountFrequency (%)
S 232
13.4%
B 173
 
10.0%
G 160
 
9.3%
A 139
 
8.1%
C 119
 
6.9%
V 101
 
5.9%
T 79
 
4.6%
E 72
 
4.2%
O 70
 
4.1%
P 67
 
3.9%
Other values (16) 513
29.7%
Lowercase Letter
ValueCountFrequency (%)
o 220
15.9%
s 124
9.0%
e 117
 
8.5%
t 115
 
8.3%
b 107
 
7.7%
p 97
 
7.0%
a 83
 
6.0%
x 81
 
5.9%
i 69
 
5.0%
n 66
 
4.8%
Other values (15) 305
22.0%
Other Punctuation
ValueCountFrequency (%)
, 3299
93.3%
. 114
 
3.2%
& 48
 
1.4%
/ 24
 
0.7%
15
 
0.4%
8
 
0.2%
· 7
 
0.2%
# 5
 
0.1%
' 3
 
0.1%
3
 
0.1%
Other values (6) 11
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 13019
25.1%
2 7905
15.2%
3 5494
10.6%
0 5010
 
9.7%
4 4402
 
8.5%
5 3945
 
7.6%
6 3428
 
6.6%
7 3082
 
5.9%
8 2819
 
5.4%
9 2727
 
5.3%
Other values (4) 8
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 7445
99.5%
34
 
0.5%
[ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 7441
99.5%
34
 
0.5%
] 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 35
97.2%
+ 1
 
2.8%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
43681
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1903
100.0%
Other Symbol
ValueCountFrequency (%)
103
100.0%
Control
ValueCountFrequency (%)
5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 205028
63.2%
Common 115962
35.8%
Latin 3113
 
1.0%
Han 209
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12791
 
6.2%
12771
 
6.2%
11197
 
5.5%
11095
 
5.4%
10801
 
5.3%
10335
 
5.0%
10127
 
4.9%
9567
 
4.7%
4688
 
2.3%
4398
 
2.1%
Other values (879) 107258
52.3%
Latin
ValueCountFrequency (%)
S 232
 
7.5%
o 220
 
7.1%
B 173
 
5.6%
G 160
 
5.1%
A 139
 
4.5%
s 124
 
4.0%
C 119
 
3.8%
e 117
 
3.8%
t 115
 
3.7%
b 107
 
3.4%
Other values (43) 1607
51.6%
Common
ValueCountFrequency (%)
43681
37.7%
1 13019
 
11.2%
2 7905
 
6.8%
( 7445
 
6.4%
) 7441
 
6.4%
3 5494
 
4.7%
0 5010
 
4.3%
4 4402
 
3.8%
5 3945
 
3.4%
6 3428
 
3.0%
Other values (32) 14192
 
12.2%
Han
ValueCountFrequency (%)
196
93.8%
3
 
1.4%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
Other values (2) 2
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 204917
63.2%
ASCII 118962
36.7%
None 212
 
0.1%
CJK 209
 
0.1%
Compat Jamo 8
 
< 0.1%
Number Forms 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43681
36.7%
1 13019
 
10.9%
2 7905
 
6.6%
( 7445
 
6.3%
) 7441
 
6.3%
3 5494
 
4.6%
0 5010
 
4.2%
4 4402
 
3.7%
5 3945
 
3.3%
6 3428
 
2.9%
Other values (73) 17192
 
14.5%
Hangul
ValueCountFrequency (%)
12791
 
6.2%
12771
 
6.2%
11197
 
5.5%
11095
 
5.4%
10801
 
5.3%
10335
 
5.0%
10127
 
4.9%
9567
 
4.7%
4688
 
2.3%
4398
 
2.1%
Other values (872) 107147
52.3%
CJK
ValueCountFrequency (%)
196
93.8%
3
 
1.4%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
Other values (2) 2
 
1.0%
None
ValueCountFrequency (%)
103
48.6%
34
 
16.0%
34
 
16.0%
15
 
7.1%
8
 
3.8%
· 7
 
3.3%
4
 
1.9%
3
 
1.4%
2
 
0.9%
1
 
0.5%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Compat Jamo
ValueCountFrequency (%)
3
37.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-06-30
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-30
2nd row2023-06-30
3rd row2023-06-30
4th row2023-06-30
5th row2023-06-30

Common Values

ValueCountFrequency (%)
2023-06-30 10000
100.0%

Length

2024-04-21T18:33:07.431617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T18:33:07.589637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-30 10000
100.0%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Interactions

2024-04-21T18:33:00.762365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-21T18:33:00.959826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T18:33:01.131326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번가맹점명가맹점기본주소데이터기준일자Unnamed: 4
8019580196세븐일레븐 부산부전에코점부산광역시 부산진구 중앙대로 653 (부전동)세븐일레븐 부산부전에코점2023-06-30<NA>
8916989170보임미디어부산광역시 부산진구 신천대로220번길 93-9보임미디어2023-06-30<NA>
3245232453아모레 카운셀러_성종*부산광역시 동래구 아시아드대로 225 3층 (온천동,미남메디칼센터)빌딩 및 상가 內2023-06-30<NA>
3670436705다통돼지볶음부산광역시 중구 중구로23번길 91층2023-06-30<NA>
1704117042불맛석쇠쭈꾸미&빈대떡부산광역시 영도구 남항로49번길 42(영선동1가)2023-06-30<NA>
2634226343동마이부산광역시 부산진구 서전로38번길 9F동 11호2023-06-30<NA>
1756817569불막열삼부산광역시 해운대구 해운대로383번길 70201동 201호 (우동)2023-06-30<NA>
9631696317동매카페마당부산광역시 사하구 다대로134번길 981층 동매카페마당2023-06-30<NA>
4194741948송정종합농기계부산광역시 기장군 철마면 두송길 29송정종합농기계2023-06-30<NA>
6006060061개인택시(김정선)부산광역시 영도구 와치로 231, 210동 1302호 (동삼동, 도개공동삼절영아파트)개인택시부산37바92332023-06-30<NA>
연번가맹점명가맹점기본주소데이터기준일자Unnamed: 4
7824678247짚신매운갈비찜부산광역시 북구 금곡대로285번길 39(화명동)102 호 짚신매운갈비찜2023-06-30<NA>
4245642457한국야쿠르트 명지점 9부산광역시 강서구 명지국제6로302번길 19-71층2023-06-30<NA>
2102321024알찬당구클럽부산광역시 동래구 충렬대로359번길 38 (안락동)6층2023-06-30<NA>
8856288563팟샵(potshop)부산광역시 수영구 수영로607번길 261층(광안동)2023-06-30<NA>
9400294003LG오휘화장품부산광역시 사상구 가야대로 370LG오휘화장품2023-06-30<NA>
4829148292신라해장국부산광역시 사하구 하신중앙로22번길 26 (장림동)신라해장국2023-06-30<NA>
8945489455I LOVE컴셈틀교습소부산광역시 동구 초량로 471층. 아이러브컴셈틀교습소I LOVE컴셈틀교습소2023-06-30<NA>
77457746임대부산광역시 해운대구 아랫반송로 35꾸바꾸바 (반송동)2023-06-30<NA>
1729817299도경횟집부산광역시 강서구 신호산단1로 54도경횟집 (신호동)2023-06-30<NA>
9214692147고봉민김밥인부산광역시 부산진구 가야대로 613고봉민김밥인2023-06-30<NA>