Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 500 |
Missing cells | 12 |
Missing cells (%) | 0.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.6 KiB |
Average record size in memory | 42.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
Text | 2 |
Dataset
Description | Sample |
---|---|
Author | 레드테이블 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=644dc1d5-8693-46da-8203-0f7f13957690 |
CTY_NM has constant value "" | Constant |
RSTRNT_TEL_NO has 12 (2.4%) missing values | Missing |
OVSEA_RSTRNT_ID has unique values | Unique |
RSTRNT_NM has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:47:56.739278 |
---|---|
Analysis finished | 2023-12-10 09:47:59.244145 |
Duration | 2.5 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
OVSEA_RSTRNT_ID
Real number (ℝ)
UNIQUE
 
Distinct | 500 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 501247.87 |
Minimum | 500000 |
---|---|
Maximum | 502598 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 500000 |
---|---|
5-th percentile | 500155.5 |
Q1 | 500628 |
median | 501149 |
Q3 | 501890.5 |
95-th percentile | 502465.65 |
Maximum | 502598 |
Range | 2598 |
Interquartile range (IQR) | 1262.5 |
Descriptive statistics
Standard deviation | 734.62287 |
---|---|
Coefficient of variation (CV) | 0.001465588 |
Kurtosis | -1.1529263 |
Mean | 501247.87 |
Median Absolute Deviation (MAD) | 617 |
Skewness | 0.1272818 |
Sum | 2.5062393 × 108 |
Variance | 539670.77 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
500000 | 1 | 0.2% |
501638 | 1 | 0.2% |
501699 | 1 | 0.2% |
501684 | 1 | 0.2% |
501683 | 1 | 0.2% |
501680 | 1 | 0.2% |
501678 | 1 | 0.2% |
501662 | 1 | 0.2% |
501653 | 1 | 0.2% |
501647 | 1 | 0.2% |
Other values (490) | 490 |
Value | Count | Frequency (%) |
500000 | 1 | |
500001 | 1 | |
500003 | 1 | |
500018 | 1 | |
500019 | 1 | |
500028 | 1 | |
500030 | 1 | |
500046 | 1 | |
500054 | 1 | |
500057 | 1 |
Value | Count | Frequency (%) |
502598 | 1 | |
502595 | 1 | |
502586 | 1 | |
502583 | 1 | |
502581 | 1 | |
502578 | 1 | |
502569 | 1 | |
502567 | 1 | |
502542 | 1 | |
502538 | 1 |
CTY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
shanghai |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | shanghai |
---|---|
2nd row | shanghai |
3rd row | shanghai |
4th row | shanghai |
5th row | shanghai |
Common Values
Value | Count | Frequency (%) |
shanghai | 500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
shanghai | 500 |
RSTRNT_NM
Text
UNIQUE
 
Distinct | 500 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Length
Max length | 22 |
---|---|
Median length | 16 |
Mean length | 8.316 |
Min length | 2 |
Characters and Unicode
Total characters | 4158 |
---|---|
Distinct characters | 748 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 500 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 阿凡提大饭店 |
---|---|
2nd row | Sasha's萨莎 |
3rd row | 1221餐馆 |
4th row | Va Bene华万意意大利餐厅 |
5th row | Wagas 沃歌斯(中信泰富店) |
Value | Count | Frequency (%) |
阿凡提大饭店 | 1 | 0.2% |
避风塘(金玉兰 | 1 | 0.2% |
台园圆圆香云吞(徐汇店 | 1 | 0.2% |
新香园港式茶餐厅(乌鲁木齐中路店 | 1 | 0.2% |
华华川菜馆 | 1 | 0.2% |
苹果花园 | 1 | 0.2% |
新吉士酒楼(虹桥店 | 1 | 0.2% |
恒隆酒楼(春申店 | 1 | 0.2% |
小四川鱼庄(招远路总店 | 1 | 0.2% |
黑三娘(仙霞店 | 1 | 0.2% |
Other values (502) | 502 |
Most occurring characters
Value | Count | Frequency (%) |
店 | 320 | 7.7% |
( | 272 | 6.5% |
) | 272 | 6.5% |
路 | 90 | 2.2% |
酒 | 83 | 2.0% |
厅 | 68 | 1.6% |
餐 | 65 | 1.6% |
海 | 62 | 1.5% |
家 | 61 | 1.5% |
大 | 49 | 1.2% |
Other values (738) | 2816 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3454 | |
Open Punctuation | 272 | 6.5% |
Close Punctuation | 272 | 6.5% |
Lowercase Letter | 82 | 2.0% |
Uppercase Letter | 51 | 1.2% |
Space Separator | 12 | 0.3% |
Decimal Number | 10 | 0.2% |
Other Punctuation | 4 | 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
店 | 320 | 9.3% |
路 | 90 | 2.6% |
酒 | 83 | 2.4% |
厅 | 68 | 2.0% |
餐 | 65 | 1.9% |
海 | 62 | 1.8% |
家 | 61 | 1.8% |
大 | 49 | 1.4% |
南 | 43 | 1.2% |
小 | 43 | 1.2% |
Other values (685) | 2570 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 15 | |
e | 12 | |
i | 9 | |
s | 8 | |
n | 6 | 7.3% |
c | 4 | 4.9% |
l | 4 | 4.9% |
o | 3 | 3.7% |
h | 3 | 3.7% |
t | 3 | 3.7% |
Other values (12) | 15 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 7 | |
T | 4 | 7.8% |
O | 4 | 7.8% |
D | 3 | 5.9% |
S | 3 | 5.9% |
A | 3 | 5.9% |
N | 3 | 5.9% |
K | 3 | 5.9% |
W | 3 | 5.9% |
L | 2 | 3.9% |
Other values (11) | 16 |
Decimal Number
Value | Count | Frequency (%) |
5 | 3 | |
0 | 2 | |
2 | 2 | |
1 | 2 | |
8 | 1 | 10.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 272 |
Close Punctuation
Value | Count | Frequency (%) |
) | 272 |
Space Separator
Value | Count | Frequency (%) |
12 |
Other Punctuation
Value | Count | Frequency (%) |
' | 4 |
Dash Punctuation
Value | Count | Frequency (%) |
— | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 3454 | |
Common | 571 | 13.7% |
Latin | 133 | 3.2% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
店 | 320 | 9.3% |
路 | 90 | 2.6% |
酒 | 83 | 2.4% |
厅 | 68 | 2.0% |
餐 | 65 | 1.9% |
海 | 62 | 1.8% |
家 | 61 | 1.8% |
大 | 49 | 1.4% |
南 | 43 | 1.2% |
小 | 43 | 1.2% |
Other values (685) | 2570 |
Latin
Value | Count | Frequency (%) |
a | 15 | 11.3% |
e | 12 | 9.0% |
i | 9 | 6.8% |
s | 8 | 6.0% |
C | 7 | 5.3% |
n | 6 | 4.5% |
c | 4 | 3.0% |
T | 4 | 3.0% |
O | 4 | 3.0% |
l | 4 | 3.0% |
Other values (33) | 60 |
Common
Value | Count | Frequency (%) |
( | 272 | |
) | 272 | |
12 | 2.1% | |
' | 4 | 0.7% |
5 | 3 | 0.5% |
0 | 2 | 0.4% |
2 | 2 | 0.4% |
1 | 2 | 0.4% |
8 | 1 | 0.2% |
— | 1 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 3454 | |
ASCII | 703 | 16.9% |
Punctuation | 1 | < 0.1% |
Most frequent character per block
CJK
Value | Count | Frequency (%) |
店 | 320 | 9.3% |
路 | 90 | 2.6% |
酒 | 83 | 2.4% |
厅 | 68 | 2.0% |
餐 | 65 | 1.9% |
海 | 62 | 1.8% |
家 | 61 | 1.8% |
大 | 49 | 1.4% |
南 | 43 | 1.2% |
小 | 43 | 1.2% |
Other values (685) | 2570 |
ASCII
Value | Count | Frequency (%) |
( | 272 | |
) | 272 | |
a | 15 | 2.1% |
e | 12 | 1.7% |
12 | 1.7% | |
i | 9 | 1.3% |
s | 8 | 1.1% |
C | 7 | 1.0% |
n | 6 | 0.9% |
' | 4 | 0.6% |
Other values (42) | 86 | 12.2% |
Punctuation
Value | Count | Frequency (%) |
— | 1 |
RSTRNT_TEL_NO
Real number (ℝ)
MISSING
 
Distinct | 480 |
---|---|
Distinct (%) | 98.4% |
Missing | 12 |
Missing (%) | 2.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.5137406 × 1010 |
Minimum | 2.1223518 × 109 |
---|---|
Maximum | 6.8868889 × 1011 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 2.1223518 × 109 |
---|---|
5-th percentile | 2.1529264 × 109 |
Q1 | 2.1621781 × 109 |
median | 2.1632305 × 109 |
Q3 | 2.1646594 × 109 |
95-th percentile | 5.0471235 × 1011 |
Maximum | 6.8868889 × 1011 |
Range | 6.8656654 × 1011 |
Interquartile range (IQR) | 2481305 |
Descriptive statistics
Standard deviation | 1.3610182 × 1011 |
---|---|
Coefficient of variation (CV) | 3.8734168 |
Kurtosis | 14.035169 |
Mean | 3.5137406 × 1010 |
Median Absolute Deviation (MAD) | 1160998.5 |
Skewness | 3.9727987 |
Sum | 1.7147054 × 1013 |
Variance | 1.8523706 × 1022 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2153965000 | 3 | 0.6% |
4008209777 | 3 | 0.6% |
4001917917 | 2 | 0.4% |
2163739458 | 2 | 0.4% |
2163403067 | 2 | 0.4% |
504712348908 | 2 | 0.4% |
2154370078 | 1 | 0.2% |
2162293377 | 1 | 0.2% |
2164018048 | 1 | 0.2% |
2164012583 | 1 | 0.2% |
Other values (470) | 470 | |
(Missing) | 12 | 2.4% |
Value | Count | Frequency (%) |
2122351753 | 1 | |
2132070213 | 1 | |
2133134888 | 1 | |
2150306659 | 1 | |
2150471234 | 1 | |
2150471266 | 1 | |
2150471917 | 1 | |
2150478838 | 1 | |
2150490703 | 1 | |
2150714876 | 1 |
Value | Count | Frequency (%) |
688688888789 | 1 | |
688688888728 | 1 | |
641555882756 | 1 | |
641555882431 | 1 | |
641511115217 | 1 | |
641511115216 | 1 | |
641511115212 | 1 | |
640665188608 | 1 | |
633518887368 | 1 | |
627588884814 | 1 |
RSTRNT_ADDR
Text
Distinct | 481 |
---|---|
Distinct (%) | 96.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Length
Max length | 46 |
---|---|
Median length | 37 |
Mean length | 20.3 |
Min length | 5 |
Characters and Unicode
Total characters | 10150 |
---|---|
Distinct characters | 545 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 473 ? |
---|---|
Unique (%) | 94.6% |
Sample
1st row | 虹口区曲阳路775号天山宾馆B1楼 |
---|---|
2nd row | 徐汇区东平路11号(近衡山路) |
3rd row | 长宁区延安西路1221号(近番禺路) |
4th row | 卢湾区太仓路181弄新天地北里7号楼2楼(近马当路) |
5th row | 静安区南京西路1168号中信泰富B1楼(近陕西北路) |
Value | Count | Frequency (%) |
多家连锁店 | 11 | 2.2% |
浦东新区世纪大道88号金茂君悦大酒店56楼(近二号线陆家嘴站 | 3 | 0.6% |
静安区威海路500号四季酒店2楼(近石门一路 | 3 | 0.6% |
徐汇区汾阳路150号(近桃江路 | 2 | 0.4% |
静安区石门二路19号(近南京西路 | 2 | 0.4% |
徐汇区天平路220号(近康平路 | 2 | 0.4% |
黄浦区豫园路98号(近绿波廊 | 2 | 0.4% |
黄浦区九江路555号王宝和大酒店2楼(近福建中路 | 2 | 0.4% |
2 | 0.4% | |
闵行区虹梅路3293号(近延安西路 | 1 | 0.2% |
Other values (472) | 472 |
Most occurring characters
Value | Count | Frequency (%) |
路 | 938 | 9.2% |
号 | 512 | 5.0% |
区 | 496 | 4.9% |
) | 482 | 4.7% |
( | 482 | 4.7% |
近 | 404 | 4.0% |
1 | 373 | 3.7% |
2 | 209 | 2.1% |
8 | 202 | 2.0% |
5 | 182 | 1.8% |
Other values (535) | 5870 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7345 | |
Decimal Number | 1755 | 17.3% |
Close Punctuation | 485 | 4.8% |
Open Punctuation | 485 | 4.8% |
Dash Punctuation | 33 | 0.3% |
Uppercase Letter | 28 | 0.3% |
Other Punctuation | 17 | 0.2% |
Space Separator | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
路 | 938 | 12.8% |
号 | 512 | 7.0% |
区 | 496 | 6.8% |
近 | 404 | 5.5% |
浦 | 160 | 2.2% |
东 | 155 | 2.1% |
楼 | 154 | 2.1% |
南 | 135 | 1.8% |
安 | 114 | 1.6% |
汇 | 109 | 1.5% |
Other values (504) | 4168 |
Decimal Number
Value | Count | Frequency (%) |
1 | 373 | |
2 | 209 | |
8 | 202 | |
5 | 182 | |
3 | 169 | |
0 | 168 | |
4 | 119 | 6.8% |
7 | 112 | 6.4% |
6 | 111 | 6.3% |
9 | 110 | 6.3% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 12 | |
A | 5 | |
H | 2 | 7.1% |
O | 2 | 7.1% |
S | 2 | 7.1% |
E | 2 | 7.1% |
T | 1 | 3.6% |
F | 1 | 3.6% |
C | 1 | 3.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 7 | |
, | 4 | |
、 | 3 | |
/ | 1 | 5.9% |
: | 1 | 5.9% |
。 | 1 | 5.9% |
Close Punctuation
Value | Count | Frequency (%) |
) | 482 | |
) | 3 | 0.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 482 | |
( | 3 | 0.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 33 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 7345 | |
Common | 2777 | 27.4% |
Latin | 28 | 0.3% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
路 | 938 | 12.8% |
号 | 512 | 7.0% |
区 | 496 | 6.8% |
近 | 404 | 5.5% |
浦 | 160 | 2.2% |
东 | 155 | 2.1% |
楼 | 154 | 2.1% |
南 | 135 | 1.8% |
安 | 114 | 1.6% |
汇 | 109 | 1.5% |
Other values (504) | 4168 |
Common
Value | Count | Frequency (%) |
) | 482 | |
( | 482 | |
1 | 373 | |
2 | 209 | |
8 | 202 | |
5 | 182 | 6.6% |
3 | 169 | 6.1% |
0 | 168 | 6.0% |
4 | 119 | 4.3% |
7 | 112 | 4.0% |
Other values (12) | 279 |
Latin
Value | Count | Frequency (%) |
B | 12 | |
A | 5 | |
H | 2 | 7.1% |
O | 2 | 7.1% |
S | 2 | 7.1% |
E | 2 | 7.1% |
T | 1 | 3.6% |
F | 1 | 3.6% |
C | 1 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 7345 | |
ASCII | 2791 | 27.5% |
None | 14 | 0.1% |
Most frequent character per block
CJK
Value | Count | Frequency (%) |
路 | 938 | 12.8% |
号 | 512 | 7.0% |
区 | 496 | 6.8% |
近 | 404 | 5.5% |
浦 | 160 | 2.2% |
东 | 155 | 2.1% |
楼 | 154 | 2.1% |
南 | 135 | 1.8% |
安 | 114 | 1.6% |
汇 | 109 | 1.5% |
Other values (504) | 4168 |
ASCII
Value | Count | Frequency (%) |
) | 482 | |
( | 482 | |
1 | 373 | |
2 | 209 | |
8 | 202 | |
5 | 182 | 6.5% |
3 | 169 | 6.1% |
0 | 168 | 6.0% |
4 | 119 | 4.3% |
7 | 112 | 4.0% |
Other values (16) | 293 |
None
Value | Count | Frequency (%) |
, | 4 | |
) | 3 | |
、 | 3 | |
( | 3 | |
。 | 1 | 7.1% |
OVSEA_RSTRNT_ID | RSTRNT_TEL_NO | |
---|---|---|
OVSEA_RSTRNT_ID | 1.000 | 0.161 |
RSTRNT_TEL_NO | 0.161 | 1.000 |
OVSEA_RSTRNT_ID | RSTRNT_TEL_NO | |
---|---|---|
OVSEA_RSTRNT_ID | 1.000 | -0.055 |
RSTRNT_TEL_NO | -0.055 | 1.000 |
OVSEA_RSTRNT_ID | CTY_NM | RSTRNT_NM | RSTRNT_TEL_NO | RSTRNT_ADDR | |
---|---|---|---|---|---|
0 | 500000 | shanghai | 阿凡提大饭店 | 2165559604 | 虹口区曲阳路775号天山宾馆B1楼 |
1 | 500001 | shanghai | Sasha's萨莎 | 2164746628 | 徐汇区东平路11号(近衡山路) |
2 | 500003 | shanghai | 1221餐馆 | 2162132441 | 长宁区延安西路1221号(近番禺路) |
3 | 500018 | shanghai | Va Bene华万意意大利餐厅 | 2163112211 | 卢湾区太仓路181弄新天地北里7号楼2楼(近马当路) |
4 | 500019 | shanghai | Wagas 沃歌斯(中信泰富店) | 2152925228 | 静安区南京西路1168号中信泰富B1楼(近陕西北路) |
5 | 500028 | shanghai | 阿山饭店 | 2162686583 | 长宁区虹桥路2378号(近动物园) |
6 | 500030 | shanghai | 艾迪多慕思 | 2162488499 | 静安区延安西路200号文艺宾馆新楼1楼(近乌鲁木齐北路) |
7 | 500046 | shanghai | 白家餐厅 | 2164376915 | 徐汇区宛平路189弄12号(近衡山路) |
8 | 500054 | shanghai | 半岛鱼翅海鲜 | 2164189393 | 徐汇区零陵路518号长航宾馆1-2楼(近东安路) |
9 | 500057 | shanghai | 宝莱纳餐厅(新天地店) | 2163203935 | 卢湾区太仓路181弄新天地北里19-20号(近马当路) |
OVSEA_RSTRNT_ID | CTY_NM | RSTRNT_NM | RSTRNT_TEL_NO | RSTRNT_ADDR | |
---|---|---|---|---|---|
490 | 502538 | shanghai | 申申面包房(复兴西路店) | 2164373493 | 徐汇区复兴西路8号(近淮海中路) |
491 | 502542 | shanghai | 上雅铁板烧(古北店) | 2162569390 | 徐汇区黄金城道851号 |
492 | 502567 | shanghai | 荣日本料理(兴义店) | 2162789778 | 长宁区兴义路48号新世纪广场B座1楼(近娄山关路) |
493 | 502569 | shanghai | 大娘水饺(天钥桥路店) | 2164649381 | 徐汇区天钥桥路57号(近肇嘉浜路) |
494 | 502578 | shanghai | 老姜烧烤 | 13391285089 | 浦东新区东方路蓝村路车站(东方路蓝村路) |
495 | 502581 | shanghai | 百味瓦罐煨汤(江西北路店) | 2163574841 | 虹口区江西北路208号(近七浦路) |
496 | 502583 | shanghai | 美新点心店 | 2162470030 | 静安区陕西北路105号(近威海路) |
497 | 502586 | shanghai | 山梁桂林米粉(仙霞路店) | 13764527052 | 长宁区仙霞路179号(近娄山关路) |
498 | 502595 | shanghai | 面包新语(美罗城店) | 2164267307 | 徐汇区肇嘉浜路1111号美罗城1-27店铺(近漕溪北路,东方商厦,港汇广场,六百,汇金百货) |
499 | 502598 | shanghai | 德兴面馆(福建中路店) | 2163602866 | 黄浦区福建中路529号(近北京东路) |