Overview

Dataset statistics

Number of variables6
Number of observations1755
Missing cells317
Missing cells (%)3.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory84.1 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Text3
Categorical1
DateTime1

Dataset

Description인천광역시 부평구 부평e음 혜택플러스가맹점 현황입니다.(연번,상호명,할인율(%),주소,전화번호,기준일자)ex) 1,맛을찾는사람들,2,인천광역시 부평구 길주로595번길 7-10 (갈산동) 1층일부,032-515-9344,2021-06-04
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15083929

Alerts

기준일자 has constant value ""Constant
전화번호 has 317 (18.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 05:40:27.796377
Analysis finished2024-01-28 05:40:28.663308
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1755
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean878
Minimum1
Maximum1755
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.6 KiB
2024-01-28T14:40:28.724600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile88.7
Q1439.5
median878
Q31316.5
95-th percentile1667.3
Maximum1755
Range1754
Interquartile range (IQR)877

Descriptive statistics

Standard deviation506.76918
Coefficient of variation (CV)0.57718585
Kurtosis-1.2
Mean878
Median Absolute Deviation (MAD)439
Skewness0
Sum1540890
Variance256815
MonotonicityStrictly increasing
2024-01-28T14:40:28.845413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1168 1
 
0.1%
1179 1
 
0.1%
1178 1
 
0.1%
1177 1
 
0.1%
1176 1
 
0.1%
1175 1
 
0.1%
1174 1
 
0.1%
1173 1
 
0.1%
1172 1
 
0.1%
Other values (1745) 1745
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1755 1
0.1%
1754 1
0.1%
1753 1
0.1%
1752 1
0.1%
1751 1
0.1%
1750 1
0.1%
1749 1
0.1%
1748 1
0.1%
1747 1
0.1%
1746 1
0.1%
Distinct1750
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size13.8 KiB
2024-01-28T14:40:29.047800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length7.9897436
Min length2

Characters and Unicode

Total characters14022
Distinct characters772
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1745 ?
Unique (%)99.4%

Sample

1st row아론모터스
2nd row진영자동차공업사
3rd row인천금영
4th row대성상사
5th row마성네일
ValueCountFrequency (%)
부평점 173
 
6.3%
삼산점 33
 
1.2%
부개점 23
 
0.8%
청천점 15
 
0.5%
갈산점 14
 
0.5%
부평구청점 12
 
0.4%
본점 12
 
0.4%
산곡점 12
 
0.4%
카페 11
 
0.4%
인천부평점 10
 
0.4%
Other values (2097) 2424
88.5%
2024-01-28T14:40:29.345227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
990
 
7.1%
619
 
4.4%
480
 
3.4%
378
 
2.7%
256
 
1.8%
252
 
1.8%
193
 
1.4%
180
 
1.3%
156
 
1.1%
) 153
 
1.1%
Other values (762) 10365
73.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12020
85.7%
Space Separator 990
 
7.1%
Uppercase Letter 255
 
1.8%
Lowercase Letter 199
 
1.4%
Close Punctuation 154
 
1.1%
Open Punctuation 153
 
1.1%
Decimal Number 151
 
1.1%
Other Punctuation 91
 
0.6%
Dash Punctuation 5
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
619
 
5.1%
480
 
4.0%
378
 
3.1%
256
 
2.1%
252
 
2.1%
193
 
1.6%
180
 
1.5%
156
 
1.3%
139
 
1.2%
135
 
1.1%
Other values (689) 9232
76.8%
Uppercase Letter
ValueCountFrequency (%)
A 22
 
8.6%
B 20
 
7.8%
C 17
 
6.7%
O 17
 
6.7%
H 16
 
6.3%
E 14
 
5.5%
S 12
 
4.7%
T 12
 
4.7%
L 12
 
4.7%
N 11
 
4.3%
Other values (15) 102
40.0%
Lowercase Letter
ValueCountFrequency (%)
e 31
15.6%
a 19
 
9.5%
o 17
 
8.5%
r 15
 
7.5%
i 15
 
7.5%
f 12
 
6.0%
t 11
 
5.5%
n 9
 
4.5%
l 8
 
4.0%
m 8
 
4.0%
Other values (14) 54
27.1%
Decimal Number
ValueCountFrequency (%)
1 34
22.5%
2 27
17.9%
5 16
10.6%
9 15
9.9%
6 14
9.3%
4 11
 
7.3%
3 10
 
6.6%
0 9
 
6.0%
8 8
 
5.3%
7 7
 
4.6%
Other Punctuation
ValueCountFrequency (%)
& 62
68.1%
. 12
 
13.2%
, 8
 
8.8%
# 6
 
6.6%
· 2
 
2.2%
; 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 153
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 152
99.3%
[ 1
 
0.7%
Space Separator
ValueCountFrequency (%)
990
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12020
85.7%
Common 1545
 
11.0%
Latin 454
 
3.2%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
619
 
5.1%
480
 
4.0%
378
 
3.1%
256
 
2.1%
252
 
2.1%
193
 
1.6%
180
 
1.5%
156
 
1.3%
139
 
1.2%
135
 
1.1%
Other values (687) 9232
76.8%
Latin
ValueCountFrequency (%)
e 31
 
6.8%
A 22
 
4.8%
B 20
 
4.4%
a 19
 
4.2%
C 17
 
3.7%
O 17
 
3.7%
o 17
 
3.7%
H 16
 
3.5%
r 15
 
3.3%
i 15
 
3.3%
Other values (39) 265
58.4%
Common
ValueCountFrequency (%)
990
64.1%
) 153
 
9.9%
( 152
 
9.8%
& 62
 
4.0%
1 34
 
2.2%
2 27
 
1.7%
5 16
 
1.0%
9 15
 
1.0%
6 14
 
0.9%
. 12
 
0.8%
Other values (13) 70
 
4.5%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12017
85.7%
ASCII 1997
 
14.2%
None 5
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
990
49.6%
) 153
 
7.7%
( 152
 
7.6%
& 62
 
3.1%
1 34
 
1.7%
e 31
 
1.6%
2 27
 
1.4%
A 22
 
1.1%
B 20
 
1.0%
a 19
 
1.0%
Other values (61) 487
24.4%
Hangul
ValueCountFrequency (%)
619
 
5.2%
480
 
4.0%
378
 
3.1%
256
 
2.1%
252
 
2.1%
193
 
1.6%
180
 
1.5%
156
 
1.3%
139
 
1.2%
135
 
1.1%
Other values (686) 9229
76.8%
None
ValueCountFrequency (%)
3
60.0%
· 2
40.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct10
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size13.8 KiB
3% 할인
804 
2% 할인
558 
배달e음 5% 할인
232 
5% 할인
83 
배달e음 3% 할인
 
40
Other values (5)
 
38

Length

Max length10
Median length5
Mean length5.8660969
Min length5

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row3% 할인
2nd row3% 할인
3rd row3% 할인
4th row3% 할인
5th row3% 할인

Common Values

ValueCountFrequency (%)
3% 할인 804
45.8%
2% 할인 558
31.8%
배달e음 5% 할인 232
 
13.2%
5% 할인 83
 
4.7%
배달e음 3% 할인 40
 
2.3%
배달e음 2% 할인 31
 
1.8%
1% 할인 3
 
0.2%
7% 할인 2
 
0.1%
배달e음 1% 할인 1
 
0.1%
4% 할인 1
 
0.1%

Length

2024-01-28T14:40:29.488192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T14:40:29.587653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
할인 1755
46.0%
3 844
22.1%
2 589
 
15.4%
5 315
 
8.3%
배달e음 304
 
8.0%
1 4
 
0.1%
7 2
 
0.1%
4 1
 
< 0.1%

주소
Text

Distinct1656
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size13.8 KiB
2024-01-28T14:40:29.835284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length45
Mean length30.388604
Min length16

Characters and Unicode

Total characters53332
Distinct characters292
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1584 ?
Unique (%)90.3%

Sample

1st row인천광역시 부평구 가좌로96번길 56(십정동)
2nd row인천광역시 부평구 가좌로96번길 60, A동(십정동)
3rd row인천광역시 부평구 갈산로 2
4th row인천광역시 부평구 갈산로5번길 6
5th row인천광역시 부평구 갈월동로 12, 1층(갈산동)
ValueCountFrequency (%)
인천광역시 1755
 
18.9%
부평구 1755
 
18.9%
1층 312
 
3.4%
1층(부평동 191
 
2.1%
부평대로 110
 
1.2%
주부토로 96
 
1.0%
장제로 64
 
0.7%
마장로 63
 
0.7%
2층 59
 
0.6%
길주남로 56
 
0.6%
Other values (1774) 4847
52.1%
2024-01-28T14:40:30.213252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9320
 
17.5%
3205
 
6.0%
1 3090
 
5.8%
2716
 
5.1%
2008
 
3.8%
1825
 
3.4%
1793
 
3.4%
1774
 
3.3%
1767
 
3.3%
1765
 
3.3%
Other values (282) 24069
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29587
55.5%
Decimal Number 9680
 
18.2%
Space Separator 9320
 
17.5%
Other Punctuation 1720
 
3.2%
Close Punctuation 1304
 
2.4%
Open Punctuation 1304
 
2.4%
Dash Punctuation 296
 
0.6%
Uppercase Letter 104
 
0.2%
Math Symbol 12
 
< 0.1%
Letter Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3205
 
10.8%
2716
 
9.2%
2008
 
6.8%
1825
 
6.2%
1793
 
6.1%
1774
 
6.0%
1767
 
6.0%
1765
 
6.0%
1765
 
6.0%
1580
 
5.3%
Other values (244) 9389
31.7%
Uppercase Letter
ValueCountFrequency (%)
A 33
31.7%
B 30
28.8%
U 14
13.5%
C 9
 
8.7%
M 4
 
3.8%
S 3
 
2.9%
K 3
 
2.9%
D 3
 
2.9%
R 1
 
1.0%
E 1
 
1.0%
Other values (3) 3
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 3090
31.9%
2 1296
13.4%
0 1110
 
11.5%
3 928
 
9.6%
4 878
 
9.1%
6 559
 
5.8%
5 530
 
5.5%
7 493
 
5.1%
9 405
 
4.2%
8 391
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 1705
99.1%
. 9
 
0.5%
& 3
 
0.2%
@ 2
 
0.1%
# 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 11
91.7%
1
 
8.3%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Lowercase Letter
ValueCountFrequency (%)
b 1
50.0%
a 1
50.0%
Space Separator
ValueCountFrequency (%)
9320
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1304
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1304
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 296
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29587
55.5%
Common 23636
44.3%
Latin 109
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3205
 
10.8%
2716
 
9.2%
2008
 
6.8%
1825
 
6.2%
1793
 
6.1%
1774
 
6.0%
1767
 
6.0%
1765
 
6.0%
1765
 
6.0%
1580
 
5.3%
Other values (244) 9389
31.7%
Common
ValueCountFrequency (%)
9320
39.4%
1 3090
 
13.1%
, 1705
 
7.2%
) 1304
 
5.5%
( 1304
 
5.5%
2 1296
 
5.5%
0 1110
 
4.7%
3 928
 
3.9%
4 878
 
3.7%
6 559
 
2.4%
Other values (11) 2142
 
9.1%
Latin
ValueCountFrequency (%)
A 33
30.3%
B 30
27.5%
U 14
12.8%
C 9
 
8.3%
M 4
 
3.7%
S 3
 
2.8%
K 3
 
2.8%
D 3
 
2.8%
2
 
1.8%
1
 
0.9%
Other values (7) 7
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29587
55.5%
ASCII 23741
44.5%
Number Forms 3
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9320
39.3%
1 3090
 
13.0%
, 1705
 
7.2%
) 1304
 
5.5%
( 1304
 
5.5%
2 1296
 
5.5%
0 1110
 
4.7%
3 928
 
3.9%
4 878
 
3.7%
6 559
 
2.4%
Other values (25) 2247
 
9.5%
Hangul
ValueCountFrequency (%)
3205
 
10.8%
2716
 
9.2%
2008
 
6.8%
1825
 
6.2%
1793
 
6.1%
1774
 
6.0%
1767
 
6.0%
1765
 
6.0%
1765
 
6.0%
1580
 
5.3%
Other values (244) 9389
31.7%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
Math Operators
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct1317
Distinct (%)91.6%
Missing317
Missing (%)18.1%
Memory size13.8 KiB
2024-01-28T14:40:30.423543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.065369
Min length9

Characters and Unicode

Total characters17350
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1235 ?
Unique (%)85.9%

Sample

1st row032-201-7788
2nd row032-584-4971
3rd row032-517-0411
4th row032-502-5843
5th row032-518-7778
ValueCountFrequency (%)
032-511-4067 7
 
0.5%
032-501-2292 5
 
0.3%
032-501-4545 5
 
0.3%
032-515-6415 5
 
0.3%
032-523-3838 4
 
0.3%
032-528-6080 4
 
0.3%
032-504-1592 4
 
0.3%
032-516-5592 4
 
0.3%
032-528-4545 4
 
0.3%
070-7517-7876 3
 
0.2%
Other values (1307) 1393
96.9%
2024-01-28T14:40:30.735921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2873
16.6%
0 2726
15.7%
2 2603
15.0%
3 2205
12.7%
5 1907
11.0%
1 1189
6.9%
7 930
 
5.4%
8 840
 
4.8%
9 731
 
4.2%
6 703
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14477
83.4%
Dash Punctuation 2873
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2726
18.8%
2 2603
18.0%
3 2205
15.2%
5 1907
13.2%
1 1189
8.2%
7 930
 
6.4%
8 840
 
5.8%
9 731
 
5.0%
6 703
 
4.9%
4 643
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 2873
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17350
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2873
16.6%
0 2726
15.7%
2 2603
15.0%
3 2205
12.7%
5 1907
11.0%
1 1189
6.9%
7 930
 
5.4%
8 840
 
4.8%
9 731
 
4.2%
6 703
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17350
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2873
16.6%
0 2726
15.7%
2 2603
15.0%
3 2205
12.7%
5 1907
11.0%
1 1189
6.9%
7 930
 
5.4%
8 840
 
4.8%
9 731
 
4.2%
6 703
 
4.1%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.8 KiB
Minimum2023-05-02 00:00:00
Maximum2023-05-02 00:00:00
2024-01-28T14:40:30.837283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:40:30.910398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-28T14:40:28.412560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T14:40:30.969193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번할인율(퍼센트)
연번1.0000.227
할인율(퍼센트)0.2271.000
2024-01-28T14:40:31.034970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번할인율(퍼센트)
연번1.0000.071
할인율(퍼센트)0.0711.000

Missing values

2024-01-28T14:40:28.525065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T14:40:28.625959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호명할인율(퍼센트)주소전화번호기준일자
01아론모터스3% 할인인천광역시 부평구 가좌로96번길 56(십정동)032-201-77882023-05-02
12진영자동차공업사3% 할인인천광역시 부평구 가좌로96번길 60, A동(십정동)032-584-49712023-05-02
23인천금영3% 할인인천광역시 부평구 갈산로 2032-517-04112023-05-02
34대성상사3% 할인인천광역시 부평구 갈산로5번길 6<NA>2023-05-02
45마성네일3% 할인인천광역시 부평구 갈월동로 12, 1층(갈산동)<NA>2023-05-02
56쇼콜라클라우드3% 할인인천광역시 부평구 갈월동로 13-1032-502-58432023-05-02
67쌍칼국수86부평본점배달e음 5% 할인인천광역시 부평구 갈월동로 14, 1층(갈산동)032-518-77782023-05-02
78대한냉면고기친구부평본점배달e음 5% 할인인천광역시 부평구 갈월동로 14, 1층(갈산동)032-518-77782023-05-02
89구구마(GUGUMA)2% 할인인천광역시 부평구 갈월동로 17,1층(갈산동)<NA>2023-05-02
910기라성2% 할인인천광역시 부평구 갈월동로 40, 1층 105호, 110호(갈산동,한국상가동)032-552-14642023-05-02
연번상호명할인율(퍼센트)주소전화번호기준일자
17451746게이트맨 벽산열쇠2% 할인인천광역시 부평구 후정동로 8 (삼산동, 태동빌딩)<NA>2023-05-02
17461747덕수파스타 부평점배달e음 5% 할인인천광역시 부평구 후정동로 8, 1층(삼산동)<NA>2023-05-02
17471748연두밥상2% 할인인천광역시 부평구 후정동로25번길 13(삼산동)<NA>2023-05-02
17481749금손푸드2% 할인인천광역시 부평구 후정동로25번길 13(삼산동)032-529-19582023-05-02
17491750서가네메밀촌2% 할인인천광역시 부평구 후정동로25번길 27(삼산동)032-519-31572023-05-02
17501751수민헤어샵3% 할인인천광역시 부평구 후정동로25번길 4, 201호032-519-88802023-05-02
17511752바다애꽃(인천삼산점)2% 할인인천광역시 부평구 후정동로25번길 7(삼산동)<NA>2023-05-02
17521753구피샵 삼산점2% 할인인천광역시 부평구 후정동로25번길 7(삼산동)032-505-59662023-05-02
17531754피아노 스토리2% 할인인천광역시 부평구 후정동로47번길 4,1동 117호(삼산동, 삼보아파트)<NA>2023-05-02
17541755김스헤어3% 할인인천광역시 부평구 후정로 7, 209동 101호(삼산동, 부평삼산엠코)032-503-00452023-05-02