Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells10000
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Numeric1
Text2
Categorical1
Unsupported1

Dataset

Description부산광역시 제로페이 가맹점 현황에 대한 데이터로 상호명, 시군구, 가맹점기본주소, 위도. 경도, 데이터기준일자 항목정보를 제공합니다.
URLhttps://www.data.go.kr/data/15078025/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Unnamed: 4 has 10000 (100.0%) missing valuesMissing
연번 has unique valuesUnique
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 13:56:35.223259
Analysis finished2023-12-12 13:56:36.994829
Duration1.77 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50682.45
Minimum5
Maximum99991
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:56:37.074439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile4814.8
Q125843.5
median50867
Q376094
95-th percentile95373.1
Maximum99991
Range99986
Interquartile range (IQR)50250.5

Descriptive statistics

Standard deviation29034.303
Coefficient of variation (CV)0.572867
Kurtosis-1.2037493
Mean50682.45
Median Absolute Deviation (MAD)25103
Skewness-0.029450592
Sum5.068245 × 108
Variance8.4299076 × 108
MonotonicityNot monotonic
2023-12-12T22:56:37.227449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39110 1
 
< 0.1%
86892 1
 
< 0.1%
27547 1
 
< 0.1%
15300 1
 
< 0.1%
8455 1
 
< 0.1%
72434 1
 
< 0.1%
47710 1
 
< 0.1%
74138 1
 
< 0.1%
50041 1
 
< 0.1%
87895 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
5 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
16 1
< 0.1%
22 1
< 0.1%
48 1
< 0.1%
57 1
< 0.1%
61 1
< 0.1%
64 1
< 0.1%
77 1
< 0.1%
ValueCountFrequency (%)
99991 1
< 0.1%
99979 1
< 0.1%
99970 1
< 0.1%
99960 1
< 0.1%
99956 1
< 0.1%
99941 1
< 0.1%
99930 1
< 0.1%
99921 1
< 0.1%
99914 1
< 0.1%
99907 1
< 0.1%
Distinct9738
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:56:37.607404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length31
Mean length7.1555
Min length1

Characters and Unicode

Total characters71555
Distinct characters1107
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9532 ?
Unique (%)95.3%

Sample

1st row전주소반
2nd row고성비엠
3rd row해로
4th row조선비생고기
5th row송현아뷰티
ValueCountFrequency (%)
아모레 153
 
1.2%
한국야쿠르트 113
 
0.9%
㈜비지에프네트웍스 91
 
0.7%
gs 79
 
0.6%
postbox 79
 
0.6%
세븐일레븐 74
 
0.6%
주식회사 73
 
0.6%
롯데택배 65
 
0.5%
gs25 60
 
0.5%
씨유 59
 
0.4%
Other values (10633) 12348
93.6%
2023-12-12T22:56:38.127914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3226
 
4.5%
2031
 
2.8%
1502
 
2.1%
1387
 
1.9%
929
 
1.3%
928
 
1.3%
827
 
1.2%
786
 
1.1%
) 761
 
1.1%
( 761
 
1.1%
Other values (1097) 58417
81.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60345
84.3%
Space Separator 3226
 
4.5%
Uppercase Letter 2206
 
3.1%
Lowercase Letter 2119
 
3.0%
Decimal Number 1394
 
1.9%
Open Punctuation 798
 
1.1%
Close Punctuation 797
 
1.1%
Other Punctuation 364
 
0.5%
Connector Punctuation 160
 
0.2%
Other Symbol 111
 
0.2%
Other values (3) 35
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2031
 
3.4%
1502
 
2.5%
1387
 
2.3%
929
 
1.5%
928
 
1.5%
827
 
1.4%
786
 
1.3%
736
 
1.2%
730
 
1.2%
694
 
1.2%
Other values (1000) 49795
82.5%
Uppercase Letter
ValueCountFrequency (%)
S 338
15.3%
G 264
 
12.0%
A 112
 
5.1%
E 112
 
5.1%
C 110
 
5.0%
B 108
 
4.9%
O 108
 
4.9%
V 103
 
4.7%
N 91
 
4.1%
M 90
 
4.1%
Other values (16) 770
34.9%
Lowercase Letter
ValueCountFrequency (%)
o 311
14.7%
e 188
 
8.9%
a 170
 
8.0%
s 162
 
7.6%
t 154
 
7.3%
p 115
 
5.4%
i 112
 
5.3%
b 109
 
5.1%
r 103
 
4.9%
n 93
 
4.4%
Other values (16) 602
28.4%
Decimal Number
ValueCountFrequency (%)
2 374
26.8%
5 212
15.2%
1 193
13.8%
4 136
 
9.8%
3 104
 
7.5%
0 97
 
7.0%
7 75
 
5.4%
9 70
 
5.0%
8 67
 
4.8%
6 59
 
4.2%
Other values (6) 7
 
0.5%
Other Punctuation
ValueCountFrequency (%)
* 152
41.8%
& 81
22.3%
. 51
 
14.0%
, 42
 
11.5%
10
 
2.7%
' 8
 
2.2%
# 8
 
2.2%
/ 5
 
1.4%
: 2
 
0.5%
; 1
 
0.3%
Other values (4) 4
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 761
95.5%
35
 
4.4%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 761
95.4%
36
 
4.5%
[ 1
 
0.1%
Other Symbol
ValueCountFrequency (%)
110
99.1%
1
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 26
96.3%
1
 
3.7%
Math Symbol
ValueCountFrequency (%)
+ 5
83.3%
~ 1
 
16.7%
Space Separator
ValueCountFrequency (%)
3226
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 160
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60443
84.5%
Common 6773
 
9.5%
Latin 4327
 
6.0%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2031
 
3.4%
1502
 
2.5%
1387
 
2.3%
929
 
1.5%
928
 
1.5%
827
 
1.4%
786
 
1.3%
736
 
1.2%
730
 
1.2%
694
 
1.1%
Other values (989) 49893
82.5%
Latin
ValueCountFrequency (%)
S 338
 
7.8%
o 311
 
7.2%
G 264
 
6.1%
e 188
 
4.3%
a 170
 
3.9%
s 162
 
3.7%
t 154
 
3.6%
p 115
 
2.7%
i 112
 
2.6%
A 112
 
2.6%
Other values (43) 2401
55.5%
Common
ValueCountFrequency (%)
3226
47.6%
) 761
 
11.2%
( 761
 
11.2%
2 374
 
5.5%
5 212
 
3.1%
1 193
 
2.8%
_ 160
 
2.4%
* 152
 
2.2%
4 136
 
2.0%
3 104
 
1.5%
Other values (33) 694
 
10.2%
Han
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60330
84.3%
ASCII 11006
 
15.4%
None 201
 
0.3%
CJK 11
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Number Forms 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3226
29.3%
) 761
 
6.9%
( 761
 
6.9%
2 374
 
3.4%
S 338
 
3.1%
o 311
 
2.8%
G 264
 
2.4%
5 212
 
1.9%
1 193
 
1.8%
e 188
 
1.7%
Other values (72) 4378
39.8%
Hangul
ValueCountFrequency (%)
2031
 
3.4%
1502
 
2.5%
1387
 
2.3%
929
 
1.5%
928
 
1.5%
827
 
1.4%
786
 
1.3%
736
 
1.2%
730
 
1.2%
694
 
1.2%
Other values (986) 49780
82.5%
None
ValueCountFrequency (%)
110
54.7%
36
 
17.9%
35
 
17.4%
10
 
5.0%
2
 
1.0%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
Other values (3) 3
 
1.5%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct9796
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:56:38.553372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length68
Mean length32.4049
Min length15

Characters and Unicode

Total characters324049
Distinct characters1006
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9702 ?
Unique (%)97.0%

Sample

1st row부산광역시 연제구 월드컵대로 160 (연산동)1층 일부호
2nd row부산광역시 수영구 망미번영로 23 (광안동)고성비엠
3rd row부산광역시 영도구 남항새싹4길 1 (영선동4가)해로 지상1층
4th row부산광역시 사상구 사상로277번길 19 (덕포동)조선비생고기
5th row부산영도구절영로7번길36송현아뷰티
ValueCountFrequency (%)
부산광역시 10014
 
18.8%
부산진구 1337
 
2.5%
해운대구 1005
 
1.9%
동래구 813
 
1.5%
사하구 698
 
1.3%
사상구 688
 
1.3%
금정구 665
 
1.2%
수영구 630
 
1.2%
남구 606
 
1.1%
연제구 590
 
1.1%
Other values (16501) 36292
68.0%
2023-12-12T22:56:39.105778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43658
 
13.5%
1 13441
 
4.1%
12879
 
4.0%
12872
 
4.0%
11071
 
3.4%
11069
 
3.4%
10774
 
3.3%
10325
 
3.2%
10173
 
3.1%
9615
 
3.0%
Other values (996) 178172
55.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 205319
63.4%
Decimal Number 51920
 
16.0%
Space Separator 43659
 
13.5%
Close Punctuation 7366
 
2.3%
Open Punctuation 7364
 
2.3%
Other Punctuation 3536
 
1.1%
Dash Punctuation 1838
 
0.6%
Uppercase Letter 1649
 
0.5%
Lowercase Letter 1250
 
0.4%
Other Symbol 95
 
< 0.1%
Other values (4) 53
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12879
 
6.3%
12872
 
6.3%
11071
 
5.4%
11069
 
5.4%
10774
 
5.2%
10325
 
5.0%
10173
 
5.0%
9615
 
4.7%
4672
 
2.3%
4354
 
2.1%
Other values (893) 107515
52.4%
Uppercase Letter
ValueCountFrequency (%)
S 234
14.2%
B 162
 
9.8%
G 160
 
9.7%
A 147
 
8.9%
V 100
 
6.1%
C 83
 
5.0%
E 64
 
3.9%
K 62
 
3.8%
I 61
 
3.7%
O 58
 
3.5%
Other values (16) 518
31.4%
Lowercase Letter
ValueCountFrequency (%)
o 225
18.0%
s 115
9.2%
b 108
 
8.6%
t 107
 
8.6%
p 97
 
7.8%
x 79
 
6.3%
e 76
 
6.1%
a 72
 
5.8%
i 51
 
4.1%
n 44
 
3.5%
Other values (14) 276
22.1%
Decimal Number
ValueCountFrequency (%)
1 13441
25.9%
2 7739
14.9%
3 5609
10.8%
0 5090
 
9.8%
4 4429
 
8.5%
5 3754
 
7.2%
6 3335
 
6.4%
7 3229
 
6.2%
9 2651
 
5.1%
8 2623
 
5.1%
Other values (9) 20
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
, 3349
94.7%
. 80
 
2.3%
& 40
 
1.1%
/ 21
 
0.6%
9
 
0.3%
8
 
0.2%
· 6
 
0.2%
' 6
 
0.2%
# 4
 
0.1%
* 4
 
0.1%
Other values (7) 9
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 7334
99.6%
28
 
0.4%
] 4
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 7331
99.6%
29
 
0.4%
[ 4
 
0.1%
Space Separator
ValueCountFrequency (%)
43658
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1837
99.9%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
94
98.9%
1
 
1.1%
Math Symbol
ValueCountFrequency (%)
~ 39
97.5%
+ 1
 
2.5%
Control
ValueCountFrequency (%)
9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 205250
63.3%
Common 115736
35.7%
Latin 2900
 
0.9%
Han 163
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12879
 
6.3%
12872
 
6.3%
11071
 
5.4%
11069
 
5.4%
10774
 
5.2%
10325
 
5.0%
10173
 
5.0%
9615
 
4.7%
4672
 
2.3%
4354
 
2.1%
Other values (883) 107446
52.3%
Common
ValueCountFrequency (%)
43658
37.7%
1 13441
 
11.6%
2 7739
 
6.7%
) 7334
 
6.3%
( 7331
 
6.3%
3 5609
 
4.8%
0 5090
 
4.4%
4 4429
 
3.8%
5 3754
 
3.2%
, 3349
 
2.9%
Other values (41) 14002
 
12.1%
Latin
ValueCountFrequency (%)
S 234
 
8.1%
o 225
 
7.8%
B 162
 
5.6%
G 160
 
5.5%
A 147
 
5.1%
s 115
 
4.0%
b 108
 
3.7%
t 107
 
3.7%
V 100
 
3.4%
p 97
 
3.3%
Other values (41) 1445
49.8%
Han
ValueCountFrequency (%)
153
93.9%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 205155
63.3%
ASCII 118529
36.6%
None 199
 
0.1%
CJK 162
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43658
36.8%
1 13441
 
11.3%
2 7739
 
6.5%
) 7334
 
6.2%
( 7331
 
6.2%
3 5609
 
4.7%
0 5090
 
4.3%
4 4429
 
3.7%
5 3754
 
3.2%
, 3349
 
2.8%
Other values (72) 16795
 
14.2%
Hangul
ValueCountFrequency (%)
12879
 
6.3%
12872
 
6.3%
11071
 
5.4%
11069
 
5.4%
10774
 
5.3%
10325
 
5.0%
10173
 
5.0%
9615
 
4.7%
4672
 
2.3%
4354
 
2.1%
Other values (881) 107351
52.3%
CJK
ValueCountFrequency (%)
153
94.4%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
None
ValueCountFrequency (%)
94
47.2%
29
 
14.6%
28
 
14.1%
9
 
4.5%
8
 
4.0%
· 6
 
3.0%
5
 
2.5%
5
 
2.5%
3
 
1.5%
2
 
1.0%
Other values (9) 10
 
5.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-06-30
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-30
2nd row2023-06-30
3rd row2023-06-30
4th row2023-06-30
5th row2023-06-30

Common Values

ValueCountFrequency (%)
2023-06-30 10000
100.0%

Length

2023-12-12T22:56:39.222744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:56:39.299984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-30 10000
100.0%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Interactions

2023-12-12T22:56:36.683536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T22:56:36.814627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:56:36.944292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번가맹점명가맹점기본주소데이터기준일자Unnamed: 4
3910939110전주소반부산광역시 연제구 월드컵대로 160 (연산동)1층 일부호2023-06-30<NA>
5514355144고성비엠부산광역시 수영구 망미번영로 23 (광안동)고성비엠2023-06-30<NA>
3458234583해로부산광역시 영도구 남항새싹4길 1 (영선동4가)해로 지상1층2023-06-30<NA>
2962129622조선비생고기부산광역시 사상구 사상로277번길 19 (덕포동)조선비생고기2023-06-30<NA>
2258622587송현아뷰티부산영도구절영로7번길36송현아뷰티2023-06-30<NA>
9094490945버거린부산광역시 영도구 태종로 501층 버거린버거린2023-06-30<NA>
45남도상회부산광역시 중구 자갈치해안로 52자갈치시장 1층2023-06-30<NA>
9922199222비바라비다(Viva La Vida)부산광역시 강서구 명지국제12로11번길 16-1(명지동),1층(명지동 아뜨리움빌)2023-06-30<NA>
377378온천약국부산광역시 동래구 온천장로 81-1 (온천동)온천약국2023-06-30<NA>
3266632667세진냉열시스템부산광역시 해운대구 재반로242번길 13-13 (반여동)1층2023-06-30<NA>
연번가맹점명가맹점기본주소데이터기준일자Unnamed: 4
3142331424PT NEWYORK부산광역시 수영구 수영로705번길 29PT NEWYORK2023-06-30<NA>
7229672297시골장터부산광역시 북구 덕천로 316-1 (만덕동)시골장터2023-06-30<NA>
4263542636한국야쿠르트 초량점 18부산광역시 중구 초량중로 7-11층2023-06-30<NA>
8432184322법무사사무소 로고스부산광역시 연제구 거제대로 270(거제동)7층 703호2023-06-30<NA>
86128613이충구부산광역시 강서구 낙동남로511번길 38이충구2023-06-30<NA>
4395243953㈜비지에프네트웍스 덕포고려점부산광역시 사상구 사상로276 ㈜비지에프네트웍스 덕포고려점2023-06-30<NA>
3168031681아모레 카운셀러_노양*부산광역시 해운대구 해운대로191번길 10 C동 2층 (재송동,성경식품)빌딩 및 상가 內2023-06-30<NA>
8309083091포엘커피부산광역시 수영구 광안해변로 125(남천동)1층2023-06-30<NA>
8643586436하삼동커피 다대낫개점부산광역시 사하구 다송로72번길 63(다대동)103호 104호 (예원프라자)2023-06-30<NA>
8321183212작심스터디카페(부산광역시 금련산역점)부산광역시 수영구 수영로464번길 6(남천동)목원빌딩 4층 401호2023-06-30<NA>