Overview

Dataset statistics

Number of variables5
Number of observations617
Missing cells224
Missing cells (%)7.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.0 KiB
Average record size in memory43.2 B

Variable types

Numeric3
Text2

Dataset

Description서울특별시 성동구 공공배달앱 배달특급 가맹점 현황으로 매장명, 사업자번호, 전화번호, 주소 등의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15117657/fileData.do

Alerts

전화번호 has 224 (36.3%) missing valuesMissing
순번 has unique valuesUnique
사업자번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:02:22.653115
Analysis finished2023-12-12 14:02:24.039063
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct617
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean309
Minimum1
Maximum617
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-12T23:02:24.110124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile31.8
Q1155
median309
Q3463
95-th percentile586.2
Maximum617
Range616
Interquartile range (IQR)308

Descriptive statistics

Standard deviation178.25684
Coefficient of variation (CV)0.57688297
Kurtosis-1.2
Mean309
Median Absolute Deviation (MAD)154
Skewness0
Sum190653
Variance31775.5
MonotonicityStrictly increasing
2023-12-12T23:02:24.249456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
415 1
 
0.2%
408 1
 
0.2%
409 1
 
0.2%
410 1
 
0.2%
411 1
 
0.2%
412 1
 
0.2%
413 1
 
0.2%
414 1
 
0.2%
416 1
 
0.2%
Other values (607) 607
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
617 1
0.2%
616 1
0.2%
615 1
0.2%
614 1
0.2%
613 1
0.2%
612 1
0.2%
611 1
0.2%
610 1
0.2%
609 1
0.2%
608 1
0.2%
Distinct616
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-12T23:02:24.512956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.4230146
Min length2

Characters and Unicode

Total characters5197
Distinct characters559
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique615 ?
Unique (%)99.7%

Sample

1st row케밥빙
2nd row나따오비까with허유산
3rd rowohoh불독(성수점)
4th row라사천 마라탕 왕십리점
5th row카페희다 왕십리센트라스점
ValueCountFrequency (%)
왕십리점 33
 
3.6%
한양대점 19
 
2.1%
성수점 12
 
1.3%
금호점 11
 
1.2%
행당점 9
 
1.0%
뚝도시장 7
 
0.8%
성수역점 6
 
0.7%
카페 6
 
0.7%
본점 5
 
0.5%
상왕십리점 4
 
0.4%
Other values (757) 798
87.7%
2023-12-12T23:02:24.969415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
306
 
5.9%
293
 
5.6%
135
 
2.6%
( 130
 
2.5%
) 130
 
2.5%
95
 
1.8%
95
 
1.8%
93
 
1.8%
90
 
1.7%
85
 
1.6%
Other values (549) 3745
72.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4332
83.4%
Space Separator 293
 
5.6%
Uppercase Letter 173
 
3.3%
Open Punctuation 130
 
2.5%
Close Punctuation 130
 
2.5%
Decimal Number 100
 
1.9%
Lowercase Letter 20
 
0.4%
Other Punctuation 19
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
306
 
7.1%
135
 
3.1%
95
 
2.2%
95
 
2.2%
93
 
2.1%
90
 
2.1%
85
 
2.0%
83
 
1.9%
69
 
1.6%
63
 
1.5%
Other values (508) 3218
74.3%
Uppercase Letter
ValueCountFrequency (%)
C 46
26.6%
U 44
25.4%
S 28
16.2%
G 26
15.0%
B 8
 
4.6%
Q 3
 
1.7%
I 3
 
1.7%
T 3
 
1.7%
O 3
 
1.7%
K 3
 
1.7%
Other values (5) 6
 
3.5%
Decimal Number
ValueCountFrequency (%)
2 32
32.0%
5 31
31.0%
1 14
14.0%
7 5
 
5.0%
6 5
 
5.0%
0 4
 
4.0%
9 4
 
4.0%
3 3
 
3.0%
4 1
 
1.0%
8 1
 
1.0%
Lowercase Letter
ValueCountFrequency (%)
n 5
25.0%
o 4
20.0%
h 3
15.0%
b 2
 
10.0%
t 2
 
10.0%
w 1
 
5.0%
i 1
 
5.0%
m 1
 
5.0%
c 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
& 13
68.4%
. 3
 
15.8%
, 2
 
10.5%
? 1
 
5.3%
Space Separator
ValueCountFrequency (%)
293
100.0%
Open Punctuation
ValueCountFrequency (%)
( 130
100.0%
Close Punctuation
ValueCountFrequency (%)
) 130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4332
83.4%
Common 672
 
12.9%
Latin 193
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
306
 
7.1%
135
 
3.1%
95
 
2.2%
95
 
2.2%
93
 
2.1%
90
 
2.1%
85
 
2.0%
83
 
1.9%
69
 
1.6%
63
 
1.5%
Other values (508) 3218
74.3%
Latin
ValueCountFrequency (%)
C 46
23.8%
U 44
22.8%
S 28
14.5%
G 26
13.5%
B 8
 
4.1%
n 5
 
2.6%
o 4
 
2.1%
h 3
 
1.6%
Q 3
 
1.6%
I 3
 
1.6%
Other values (14) 23
11.9%
Common
ValueCountFrequency (%)
293
43.6%
( 130
19.3%
) 130
19.3%
2 32
 
4.8%
5 31
 
4.6%
1 14
 
2.1%
& 13
 
1.9%
7 5
 
0.7%
6 5
 
0.7%
0 4
 
0.6%
Other values (7) 15
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4332
83.4%
ASCII 865
 
16.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
306
 
7.1%
135
 
3.1%
95
 
2.2%
95
 
2.2%
93
 
2.1%
90
 
2.1%
85
 
2.0%
83
 
1.9%
69
 
1.6%
63
 
1.5%
Other values (508) 3218
74.3%
ASCII
ValueCountFrequency (%)
293
33.9%
( 130
15.0%
) 130
15.0%
C 46
 
5.3%
U 44
 
5.1%
2 32
 
3.7%
5 31
 
3.6%
S 28
 
3.2%
G 26
 
3.0%
1 14
 
1.6%
Other values (31) 91
 
10.5%

사업자번호
Real number (ℝ)

UNIQUE 

Distinct617
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.445385 × 109
Minimum1.0103565 × 109
Maximum8.9609015 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-12T23:02:25.126352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0103565 × 109
5-th percentile1.3399444 × 109
Q12.0632136 × 109
median4.1440004 × 109
Q36.6409005 × 109
95-th percentile8.5537211 × 109
Maximum8.9609015 × 109
Range7.950545 × 109
Interquartile range (IQR)4.5776869 × 109

Descriptive statistics

Standard deviation2.4289801 × 109
Coefficient of variation (CV)0.5464049
Kurtosis-1.2404571
Mean4.445385 × 109
Median Absolute Deviation (MAD)2.0808882 × 109
Skewness0.34236666
Sum2.7428025 × 1012
Variance5.8999445 × 1018
MonotonicityNot monotonic
2023-12-12T23:02:25.328667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2063202704 1
 
0.2%
2061623131 1
 
0.2%
8105600592 1
 
0.2%
6320702783 1
 
0.2%
3366000806 1
 
0.2%
3515100439 1
 
0.2%
3935800790 1
 
0.2%
2061931651 1
 
0.2%
3700702658 1
 
0.2%
5804100659 1
 
0.2%
Other values (607) 607
98.4%
ValueCountFrequency (%)
1010356467 1
0.2%
1010628716 1
0.2%
1010629807 1
0.2%
1011393379 1
0.2%
1013209135 1
0.2%
1013235201 1
0.2%
1013338791 1
0.2%
1021952015 1
0.2%
1040343879 1
0.2%
1040517166 1
0.2%
ValueCountFrequency (%)
8960901489 1
0.2%
8951102080 1
0.2%
8950100268 1
0.2%
8920100861 1
0.2%
8918700085 1
0.2%
8895400333 1
0.2%
8874800170 1
0.2%
8871801361 1
0.2%
8871401174 1
0.2%
8870402209 1
0.2%

전화번호
Real number (ℝ)

MISSING 

Distinct390
Distinct (%)99.2%
Missing224
Missing (%)36.3%
Infinite0
Infinite (%)0.0%
Mean9.7265272 × 108
Minimum24523574
Maximum5.0714969 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-12T23:02:25.804355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24523574
5-th percentile24642543
Q12.2235639 × 108
median2.2293729 × 108
Q32.2299497 × 108
95-th percentile7.0773995 × 109
Maximum5.0714969 × 1010
Range5.0690445 × 1010
Interquartile range (IQR)638577

Descriptive statistics

Standard deviation4.0050206 × 109
Coefficient of variation (CV)4.1176265
Kurtosis120.6651
Mean9.7265272 × 108
Median Absolute Deviation (MAD)482740
Skewness10.088893
Sum3.8225252 × 1011
Variance1.604019 × 1019
MonotonicityNot monotonic
2023-12-12T23:02:25.962106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
222916565 2
 
0.3%
222483405 2
 
0.3%
222976272 2
 
0.3%
222953255 1
 
0.2%
222953091 1
 
0.2%
222934282 1
 
0.2%
222937292 1
 
0.2%
222938458 1
 
0.2%
222940077 1
 
0.2%
222940110 1
 
0.2%
Other values (380) 380
61.6%
(Missing) 224
36.3%
ValueCountFrequency (%)
24523574 1
0.2%
24555066 1
0.2%
24606990 1
0.2%
24610706 1
0.2%
24612395 1
0.2%
24617220 1
0.2%
24619283 1
0.2%
24619397 1
0.2%
24620700 1
0.2%
24623377 1
0.2%
ValueCountFrequency (%)
50714968882 1
0.2%
50713651114 1
0.2%
7088878512 1
0.2%
7088661688 1
0.2%
7088051218 1
0.2%
7088025100 1
0.2%
7088003759 1
0.2%
7086714208 1
0.2%
7086481289 1
0.2%
7086239282 1
0.2%

주소
Text

Distinct584
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-12T23:02:26.298091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length25.12966
Min length19

Characters and Unicode

Total characters15505
Distinct characters314
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique555 ?
Unique (%)90.0%

Sample

1st row서울특별시 성동구 왕십리광장로 17 4층
2nd row서울특별시 성동구 왕십리광장로 17 3층
3rd row서울특별시 성동구 성덕정길 68 1층2호
4th row서울특별시 성동구 무학봉28길 4-1 지하2층 B201호
5th row서울특별시 성동구 왕십리로 410 I동 127호
ValueCountFrequency (%)
서울특별시 617
18.6%
성동구 617
18.6%
1층 412
 
12.4%
왕십리로 41
 
1.2%
마장로 36
 
1.1%
2층 32
 
1.0%
마조로 26
 
0.8%
행당로 25
 
0.8%
17 20
 
0.6%
독서당로 20
 
0.6%
Other values (710) 1479
44.5%
2023-12-12T23:02:26.878426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2712
17.5%
1 1172
 
7.6%
825
 
5.3%
707
 
4.6%
652
 
4.2%
629
 
4.1%
623
 
4.0%
621
 
4.0%
617
 
4.0%
617
 
4.0%
Other values (304) 6330
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9348
60.3%
Decimal Number 2896
 
18.7%
Space Separator 2712
 
17.5%
Open Punctuation 147
 
0.9%
Close Punctuation 147
 
0.9%
Dash Punctuation 139
 
0.9%
Uppercase Letter 62
 
0.4%
Other Punctuation 34
 
0.2%
Lowercase Letter 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
825
 
8.8%
707
 
7.6%
652
 
7.0%
629
 
6.7%
623
 
6.7%
621
 
6.6%
617
 
6.6%
617
 
6.6%
504
 
5.4%
400
 
4.3%
Other values (265) 3153
33.7%
Uppercase Letter
ValueCountFrequency (%)
C 20
32.3%
U 18
29.0%
B 6
 
9.7%
I 5
 
8.1%
J 3
 
4.8%
A 2
 
3.2%
P 1
 
1.6%
S 1
 
1.6%
K 1
 
1.6%
T 1
 
1.6%
Other values (4) 4
 
6.5%
Decimal Number
ValueCountFrequency (%)
1 1172
40.5%
2 393
 
13.6%
3 236
 
8.1%
0 200
 
6.9%
4 183
 
6.3%
5 180
 
6.2%
7 169
 
5.8%
6 145
 
5.0%
8 123
 
4.2%
9 95
 
3.3%
Lowercase Letter
ValueCountFrequency (%)
e 5
25.0%
r 3
15.0%
b 3
15.0%
c 2
 
10.0%
m 2
 
10.0%
i 2
 
10.0%
k 1
 
5.0%
u 1
 
5.0%
j 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 32
94.1%
. 2
 
5.9%
Space Separator
ValueCountFrequency (%)
2712
100.0%
Open Punctuation
ValueCountFrequency (%)
( 147
100.0%
Close Punctuation
ValueCountFrequency (%)
) 147
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 139
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9348
60.3%
Common 6075
39.2%
Latin 82
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
825
 
8.8%
707
 
7.6%
652
 
7.0%
629
 
6.7%
623
 
6.7%
621
 
6.6%
617
 
6.6%
617
 
6.6%
504
 
5.4%
400
 
4.3%
Other values (265) 3153
33.7%
Latin
ValueCountFrequency (%)
C 20
24.4%
U 18
22.0%
B 6
 
7.3%
e 5
 
6.1%
I 5
 
6.1%
r 3
 
3.7%
b 3
 
3.7%
J 3
 
3.7%
c 2
 
2.4%
m 2
 
2.4%
Other values (13) 15
18.3%
Common
ValueCountFrequency (%)
2712
44.6%
1 1172
19.3%
2 393
 
6.5%
3 236
 
3.9%
0 200
 
3.3%
4 183
 
3.0%
5 180
 
3.0%
7 169
 
2.8%
( 147
 
2.4%
) 147
 
2.4%
Other values (6) 536
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9347
60.3%
ASCII 6157
39.7%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2712
44.0%
1 1172
19.0%
2 393
 
6.4%
3 236
 
3.8%
0 200
 
3.2%
4 183
 
3.0%
5 180
 
2.9%
7 169
 
2.7%
( 147
 
2.4%
) 147
 
2.4%
Other values (29) 618
 
10.0%
Hangul
ValueCountFrequency (%)
825
 
8.8%
707
 
7.6%
652
 
7.0%
629
 
6.7%
623
 
6.7%
621
 
6.6%
617
 
6.6%
617
 
6.6%
504
 
5.4%
400
 
4.3%
Other values (264) 3152
33.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T23:02:23.648131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.109631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.389869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.736609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.205055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.474993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.821429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.308186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:23.564425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:02:26.985512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업자번호전화번호
순번1.0000.0000.583
사업자번호0.0001.0000.000
전화번호0.5830.0001.000
2023-12-12T23:02:27.086340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업자번호전화번호
순번1.0000.001-0.462
사업자번호0.0011.0000.014
전화번호-0.4620.0141.000

Missing values

2023-12-12T23:02:23.920993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:02:24.004862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번매장명사업자번호전화번호주소
01케밥빙20632027047088878512서울특별시 성동구 왕십리광장로 17 4층
12나따오비까with허유산75305021987088661688서울특별시 성동구 왕십리광장로 17 3층
23ohoh불독(성수점)47154004667088051218서울특별시 성동구 성덕정길 68 1층2호
34라사천 마라탕 왕십리점22125518707088025100서울특별시 성동구 무학봉28길 4-1 지하2층 B201호
45카페희다 왕십리센트라스점49618015637088003759서울특별시 성동구 왕십리로 410 I동 127호
56호말커피62659006257086714208서울특별시 성동구 마조로 15-16 1층
67르미르미38210020977086481289서울특별시 성동구 무학봉15길 17-1 1층 르미르미 (remereme)
78히동이치킨 왕십리점51011510177086239282서울특별시 성동구 행당로17길 42 1층
89성수시루35809020797082998257서울특별시 성동구 성수일로8길 42 1층
910쉭앤칙 샌드위치샐러드 본점21517851427082821314서울특별시 성동구 아차산로13길 37 103호
순번매장명사업자번호전화번호주소
607608오늘먹고싶은족발2063173992<NA>서울특별시 성동구 한림말1길 16 (옥수동) 오늘먹고싶은족발
608609금호다방6121451084<NA>서울특별시 성동구 금호산2길 22-10 1층 금호다방
609610철순이네김치찌개(왕십리점)8662800627<NA>서울특별시 성동구 고산자로 290-10 (행당동) 1층
610611한스케익(왕십리점)1372301009<NA>서울특별시 성동구 왕십리로 390 (상왕십리동) 1층 104호
611612바르다김선생(행당점)2171040249<NA>서울특별시 성동구 행당로 103 1층
612613바르다김선생(서울숲점)6021262673<NA>서울특별시 성동구 서울숲2길 32-14 갤러리아포레 1층 117호
613614셀렉토커피(왕십리점)5670100567<NA>서울특별시 성동구 왕십리로 315 (행당동) 한동타워 1층
614615셀렉토커피(서울숲IT캐슬점)2291602750<NA>서울특별시 성동구 광나루로 130 서울숲IT캐슬
615616카페베네(한양사이버대점)2068629161<NA>서울특별시 성동구 왕십리로 220 (행당동) 2관
616617설빙(서울 한양대점)2063181018<NA>서울특별시 성동구 마조로 9, 2층