Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 100 |
Missing cells | 71 |
Missing cells (%) | 5.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 9.9 KiB |
Average record size in memory | 101.3 B |
Variable types
Text | 5 |
---|---|
Categorical | 4 |
Numeric | 2 |
DateTime | 1 |
Dataset
Description | Sample |
---|---|
Author | 레드타이 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=4d35b91f-03d3-4763-ac8d-9e60d88d6456 |
base_ymd has constant value "" | Constant |
city_gn_gu_cd is highly overall correlated with xpos_lo and 2 other fields | High correlation |
menu_pc is highly overall correlated with xpos_lo and 2 other fields | High correlation |
city_do_cd is highly overall correlated with xpos_lo and 4 other fields | High correlation |
area_nm is highly overall correlated with xpos_lo and 4 other fields | High correlation |
xpos_lo is highly overall correlated with ypos_la and 4 other fields | High correlation |
ypos_la is highly overall correlated with xpos_lo and 2 other fields | High correlation |
city_do_cd is highly imbalanced (80.6%) | Imbalance |
city_gn_gu_cd is highly imbalanced (82.3%) | Imbalance |
area_nm is highly imbalanced (80.6%) | Imbalance |
ypos_la has 3 (3.0%) missing values | Missing |
homepage_url has 65 (65.0%) missing values | Missing |
tel_no has 3 (3.0%) missing values | Missing |
entrp_nm has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:41:07.119914 |
---|---|
Analysis finished | 2023-12-10 09:41:10.940698 |
Duration | 3.82 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
entrp_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
리치몬드과자점 | 2 | 1.7% |
홍대점 | 2 | 1.7% |
쿄베이커리 | 2 | 1.7% |
남도마루 | 2 | 1.7% |
본점 | 2 | 1.7% |
고성막국수 | 1 | 0.9% |
baratie | 1 | 0.9% |
다미 | 1 | 0.9% |
송가 | 1 | 0.9% |
창고43 | 1 | 0.9% |
Other values (101) | 101 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 17 | 3.5% |
16 | 3.3% | |
점 | 14 | 2.8% |
리 | 12 | 2.4% |
원 | 8 | 1.6% |
마 | 8 | 1.6% |
동 | 7 | 1.4% |
집 | 7 | 1.4% |
산 | 7 | 1.4% |
당 | 7 | 1.4% |
Other values (207) | 389 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 458 | |
Space Separator | 16 | 3.3% |
Decimal Number | 8 | 1.6% |
Lowercase Letter | 8 | 1.6% |
Uppercase Letter | 2 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 17 | 3.7% |
점 | 14 | 3.1% |
리 | 12 | 2.6% |
원 | 8 | 1.7% |
마 | 8 | 1.7% |
동 | 7 | 1.5% |
집 | 7 | 1.5% |
산 | 7 | 1.5% |
당 | 7 | 1.5% |
치 | 6 | 1.3% |
Other values (191) | 365 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 2 | |
t | 1 | |
i | 1 | |
h | 1 | |
r | 1 | |
n | 1 | |
e | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 | |
8 | 2 | |
3 | 1 | |
4 | 1 | |
7 | 1 | |
9 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1 | |
A | 1 |
Space Separator
Value | Count | Frequency (%) |
16 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 458 | |
Common | 24 | 4.9% |
Latin | 10 | 2.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 17 | 3.7% |
점 | 14 | 3.1% |
리 | 12 | 2.6% |
원 | 8 | 1.7% |
마 | 8 | 1.7% |
동 | 7 | 1.5% |
집 | 7 | 1.5% |
산 | 7 | 1.5% |
당 | 7 | 1.5% |
치 | 6 | 1.3% |
Other values (191) | 365 |
Latin
Value | Count | Frequency (%) |
a | 2 | |
B | 1 | |
t | 1 | |
i | 1 | |
h | 1 | |
r | 1 | |
n | 1 | |
A | 1 | |
e | 1 |
Common
Value | Count | Frequency (%) |
16 | ||
1 | 2 | 8.3% |
8 | 2 | 8.3% |
3 | 1 | 4.2% |
4 | 1 | 4.2% |
7 | 1 | 4.2% |
9 | 1 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 458 | |
ASCII | 34 | 6.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 17 | 3.7% |
점 | 14 | 3.1% |
리 | 12 | 2.6% |
원 | 8 | 1.7% |
마 | 8 | 1.7% |
동 | 7 | 1.5% |
집 | 7 | 1.5% |
산 | 7 | 1.5% |
당 | 7 | 1.5% |
치 | 6 | 1.3% |
Other values (191) | 365 |
ASCII
Value | Count | Frequency (%) |
16 | ||
1 | 2 | 5.9% |
a | 2 | 5.9% |
8 | 2 | 5.9% |
3 | 1 | 2.9% |
4 | 1 | 2.9% |
B | 1 | 2.9% |
t | 1 | 2.9% |
i | 1 | 2.9% |
h | 1 | 2.9% |
Other values (6) | 6 | 17.6% |
load_addr
Text
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 36 |
---|---|
Median length | 30 |
Mean length | 21.16 |
Min length | 16 |
Characters and Unicode
Total characters | 2116 |
---|---|
Distinct characters | 133 |
Distinct categories | 5 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 98 ? |
---|---|
Unique (%) | 98.0% |
Sample
1st row | 서울특별시 강서구 방화대로49길 6-7 |
---|---|
2nd row | 제주특별자치도 서귀포시 중문동 2048-1 |
3rd row | 서울특별시 영등포구 버드나루로길 6 |
4th row | 서울특별시특별시 영등포구 양평로 85 |
5th row | 서울특별시 영등포구 당산로37길 1 |
Value | Count | Frequency (%) |
마포구 | 62 | 14.7% |
서울특별시 | 50 | 11.8% |
서울특별시특별시 | 48 | 11.4% |
영등포구 | 19 | 4.5% |
서대문구 | 11 | 2.6% |
독막로 | 5 | 1.2% |
3 | 5 | 1.2% |
동교로 | 5 | 1.2% |
성미산로 | 4 | 0.9% |
1층 | 4 | 0.9% |
Other values (161) | 209 |
Most occurring characters
Value | Count | Frequency (%) |
323 | ||
시 | 151 | 7.1% |
특 | 149 | 7.0% |
별 | 149 | 7.0% |
서 | 111 | 5.2% |
울 | 99 | 4.7% |
구 | 98 | 4.6% |
로 | 92 | 4.3% |
포 | 86 | 4.1% |
1 | 67 | 3.2% |
Other values (123) | 791 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1456 | |
Space Separator | 323 | 15.3% |
Decimal Number | 317 | 15.0% |
Dash Punctuation | 19 | 0.9% |
Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 151 | 10.4% |
특 | 149 | 10.2% |
별 | 149 | 10.2% |
서 | 111 | 7.6% |
울 | 99 | 6.8% |
구 | 98 | 6.7% |
로 | 92 | 6.3% |
포 | 86 | 5.9% |
마 | 65 | 4.5% |
길 | 54 | 3.7% |
Other values (110) | 402 |
Decimal Number
Value | Count | Frequency (%) |
1 | 67 | |
3 | 45 | |
2 | 44 | |
7 | 33 | |
6 | 28 | |
8 | 22 | 6.9% |
4 | 21 | 6.6% |
0 | 20 | 6.3% |
9 | 20 | 6.3% |
5 | 17 | 5.4% |
Space Separator
Value | Count | Frequency (%) |
323 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19 |
Uppercase Letter
Value | Count | Frequency (%) |
F | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1456 | |
Common | 659 | |
Latin | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 151 | 10.4% |
특 | 149 | 10.2% |
별 | 149 | 10.2% |
서 | 111 | 7.6% |
울 | 99 | 6.8% |
구 | 98 | 6.7% |
로 | 92 | 6.3% |
포 | 86 | 5.9% |
마 | 65 | 4.5% |
길 | 54 | 3.7% |
Other values (110) | 402 |
Common
Value | Count | Frequency (%) |
323 | ||
1 | 67 | 10.2% |
3 | 45 | 6.8% |
2 | 44 | 6.7% |
7 | 33 | 5.0% |
6 | 28 | 4.2% |
8 | 22 | 3.3% |
4 | 21 | 3.2% |
0 | 20 | 3.0% |
9 | 20 | 3.0% |
Other values (2) | 36 | 5.5% |
Latin
Value | Count | Frequency (%) |
F | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1456 | |
ASCII | 660 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
323 | ||
1 | 67 | 10.2% |
3 | 45 | 6.8% |
2 | 44 | 6.7% |
7 | 33 | 5.0% |
6 | 28 | 4.2% |
8 | 22 | 3.3% |
4 | 21 | 3.2% |
0 | 20 | 3.0% |
9 | 20 | 3.0% |
Other values (3) | 37 | 5.6% |
Hangul
Value | Count | Frequency (%) |
시 | 151 | 10.4% |
특 | 149 | 10.2% |
별 | 149 | 10.2% |
서 | 111 | 7.6% |
울 | 99 | 6.8% |
구 | 98 | 6.7% |
로 | 92 | 6.3% |
포 | 86 | 5.9% |
마 | 65 | 4.5% |
길 | 54 | 3.7% |
Other values (110) | 402 |
city_do_cd
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
11 | |
---|---|
<NA> | 3 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.06 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11 |
---|---|
2nd row | <NA> |
3rd row | 11 |
4th row | 11 |
5th row | 11 |
Common Values
Value | Count | Frequency (%) |
11 | 97 | |
<NA> | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11 | 97 | |
na | 3 | 3.0% |
city_gn_gu_cd
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
126.9 | |
---|---|
126.8 | 3 |
126.4 | 1 |
126.5 | 1 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 126.8 |
---|---|
2nd row | 126.4 |
3rd row | 126.8 |
4th row | 126.9 |
5th row | 126.8 |
Common Values
Value | Count | Frequency (%) |
126.9 | 95 | |
126.8 | 3 | 3.0% |
126.4 | 1 | 1.0% |
126.5 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
126.9 | 95 | |
126.8 | 3 | 3.0% |
126.4 | 1 | 1.0% |
126.5 | 1 | 1.0% |
xpos_lo
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 53 |
---|---|
Distinct (%) | 53.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.430981 |
Minimum | 33.250304 |
---|---|
Maximum | 37.623491 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 33.250304 |
---|---|
5-th percentile | 37.49121 |
Q1 | 37.548149 |
median | 37.566415 |
Q3 | 37.566415 |
95-th percentile | 37.566537 |
Maximum | 37.623491 |
Range | 4.373187 |
Interquartile range (IQR) | 0.018266 |
Descriptive statistics
Standard deviation | 0.71159549 |
---|---|
Coefficient of variation (CV) | 0.01901087 |
Kurtosis | 29.947246 |
Mean | 37.430981 |
Median Absolute Deviation (MAD) | 0.0018985 |
Skewness | -5.593904 |
Sum | 3743.0981 |
Variance | 0.50636814 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.566415 | 48 | |
37.577455 | 1 | 1.0% |
37.568857 | 1 | 1.0% |
37.550874 | 1 | 1.0% |
37.548923 | 1 | 1.0% |
37.560936 | 1 | 1.0% |
37.565198 | 1 | 1.0% |
37.523886 | 1 | 1.0% |
37.561899 | 1 | 1.0% |
37.560082 | 1 | 1.0% |
Other values (43) | 43 |
Value | Count | Frequency (%) |
33.250304 | 1 | |
33.481967 | 1 | |
33.492212 | 1 | |
37.480517 | 1 | |
37.481229 | 1 | |
37.491735 | 1 | |
37.50306 | 1 | |
37.507114 | 1 | |
37.518617 | 1 | |
37.520238 | 1 |
Value | Count | Frequency (%) |
37.623491 | 1 | 1.0% |
37.577455 | 1 | 1.0% |
37.572506 | 1 | 1.0% |
37.570619 | 1 | 1.0% |
37.568857 | 1 | 1.0% |
37.566415 | 48 | |
37.565198 | 1 | 1.0% |
37.56506 | 1 | 1.0% |
37.562384 | 1 | 1.0% |
37.561899 | 1 | 1.0% |
ypos_la
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 87 |
---|---|
Distinct (%) | 89.7% |
Missing | 3 |
Missing (%) | 3.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.546992 |
Minimum | 37.480517 |
---|---|
Maximum | 37.623491 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 37.480517 |
---|---|
5-th percentile | 37.516316 |
Q1 | 37.539819 |
median | 37.549468 |
Q3 | 37.558122 |
95-th percentile | 37.569944 |
Maximum | 37.623491 |
Range | 0.142974 |
Interquartile range (IQR) | 0.018303 |
Descriptive statistics
Standard deviation | 0.020119045 |
---|---|
Coefficient of variation (CV) | 0.00053583639 |
Kurtosis | 3.420242 |
Mean | 37.546992 |
Median Absolute Deviation (MAD) | 0.009649 |
Skewness | -0.60368551 |
Sum | 3642.0582 |
Variance | 0.00040477596 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.549468 | 2 | 2.0% |
37.560082 | 2 | 2.0% |
37.562384 | 2 | 2.0% |
37.548022 | 2 | 2.0% |
37.520776 | 2 | 2.0% |
37.527602 | 2 | 2.0% |
37.526238 | 2 | 2.0% |
37.561426 | 2 | 2.0% |
37.548923 | 2 | 2.0% |
37.539819 | 2 | 2.0% |
Other values (77) | 77 | |
(Missing) | 3 | 3.0% |
Value | Count | Frequency (%) |
37.480517 | 1 | |
37.481229 | 1 | |
37.491735 | 1 | |
37.50306 | 1 | |
37.507114 | 1 | |
37.518617 | 1 | |
37.519993 | 1 | |
37.520238 | 1 | |
37.520769 | 1 | |
37.520776 | 2 |
Value | Count | Frequency (%) |
37.623491 | 1 | |
37.577455 | 1 | |
37.572506 | 1 | |
37.572489 | 1 | |
37.570619 | 1 | |
37.569775 | 1 | |
37.569449 | 1 | |
37.568857 | 1 | |
37.565198 | 1 | |
37.56506 | 1 |
area_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 | |
---|---|
제주 | 3 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 제주 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
서울 | 97 | |
제주 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 97 | |
제주 | 3 | 3.0% |
homepage_url
Text
MISSING
 
Distinct | 32 |
---|---|
Distinct (%) | 91.4% |
Missing | 65 |
Missing (%) | 65.0% |
Memory size | 932.0 B |
Length
Max length | 46 |
---|---|
Median length | 35 |
Mean length | 32.171429 |
Min length | 14 |
Characters and Unicode
Total characters | 1126 |
---|---|
Distinct characters | 43 |
Distinct categories | 7 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 29 ? |
---|---|
Unique (%) | 82.9% |
Sample
1st row | http://itvplus.co.kr/home6/sam/ |
---|---|
2nd row | http://cityfood.co.kr/h9/sundaeilbeonji |
3rd row | http://www.instagram.com/tteurak_jk |
4th row | http://jinjinseoul.modoo.at/ |
5th row | http://www.richemont.co.kr/ |
Value | Count | Frequency (%) |
http://www.richemont.co.kr | 2 | 5.7% |
http://instagram.com/osteriabaratie | 2 | 5.7% |
http://www.instagram.com/hakatabunko_official | 2 | 5.7% |
http://instagram.com/nanohana.yeonhui | 1 | 2.9% |
https://www.instagram.com/tuktuknoodle | 1 | 2.9% |
https://limpasse81.modoo.at | 1 | 2.9% |
http://coffeelibre.kr | 1 | 2.9% |
http://www.changgo43.co.kr | 1 | 2.9% |
http://www.facebook.com/eeddle/?ref=bookmarks | 1 | 2.9% |
http://blog.naver.com/ksbeom | 1 | 2.9% |
Other values (22) | 22 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 113 | 10.0% |
t | 104 | 9.2% |
o | 85 | 7.5% |
a | 74 | 6.6% |
. | 70 | 6.2% |
h | 51 | 4.5% |
m | 50 | 4.4% |
w | 49 | 4.4% |
c | 47 | 4.2% |
i | 45 | 4.0% |
Other values (33) | 438 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 871 | |
Other Punctuation | 219 | 19.4% |
Decimal Number | 26 | 2.3% |
Connector Punctuation | 5 | 0.4% |
Other Letter | 3 | 0.3% |
Dash Punctuation | 1 | 0.1% |
Math Symbol | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 104 | 11.9% |
o | 85 | 9.8% |
a | 74 | 8.5% |
h | 51 | 5.9% |
m | 50 | 5.7% |
w | 49 | 5.6% |
c | 47 | 5.4% |
i | 45 | 5.2% |
e | 44 | 5.1% |
r | 44 | 5.1% |
Other values (13) | 278 |
Decimal Number
Value | Count | Frequency (%) |
2 | 6 | |
1 | 6 | |
9 | 3 | |
8 | 3 | |
0 | 2 | 7.7% |
3 | 2 | 7.7% |
5 | 1 | 3.8% |
7 | 1 | 3.8% |
6 | 1 | 3.8% |
4 | 1 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 113 | |
. | 70 | |
: | 35 | 16.0% |
? | 1 | 0.5% |
Other Letter
Value | Count | Frequency (%) |
현 | 1 | |
래 | 1 | |
장 | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 5 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Math Symbol
Value | Count | Frequency (%) |
= | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 871 | |
Common | 252 | 22.4% |
Hangul | 3 | 0.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 104 | 11.9% |
o | 85 | 9.8% |
a | 74 | 8.5% |
h | 51 | 5.9% |
m | 50 | 5.7% |
w | 49 | 5.6% |
c | 47 | 5.4% |
i | 45 | 5.2% |
e | 44 | 5.1% |
r | 44 | 5.1% |
Other values (13) | 278 |
Common
Value | Count | Frequency (%) |
/ | 113 | |
. | 70 | |
: | 35 | 13.9% |
2 | 6 | 2.4% |
1 | 6 | 2.4% |
_ | 5 | 2.0% |
9 | 3 | 1.2% |
8 | 3 | 1.2% |
0 | 2 | 0.8% |
3 | 2 | 0.8% |
Other values (7) | 7 | 2.8% |
Hangul
Value | Count | Frequency (%) |
현 | 1 | |
래 | 1 | |
장 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1123 | |
Hangul | 3 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 113 | 10.1% |
t | 104 | 9.3% |
o | 85 | 7.6% |
a | 74 | 6.6% |
. | 70 | 6.2% |
h | 51 | 4.5% |
m | 50 | 4.5% |
w | 49 | 4.4% |
c | 47 | 4.2% |
i | 45 | 4.0% |
Other values (30) | 435 |
Hangul
Value | Count | Frequency (%) |
현 | 1 | |
래 | 1 | |
장 | 1 |
tel_no
Text
MISSING
 
Distinct | 90 |
---|---|
Distinct (%) | 92.8% |
Missing | 3 |
Missing (%) | 3.0% |
Memory size | 932.0 B |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 11.484536 |
Min length | 11 |
Characters and Unicode
Total characters | 1114 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 83 ? |
---|---|
Unique (%) | 85.6% |
Sample
1st row | 02-2665-1205 |
---|---|
2nd row | 02-2634-8663 |
3rd row | 02-2634-1359 |
4th row | 02-2068-8791 |
5th row | 02-6083-1393 |
Value | Count | Frequency (%) |
02-338-5536 | 2 | 2.1% |
02-712-7462 | 2 | 2.1% |
02-325-0221 | 2 | 2.1% |
010-6490-2352 | 2 | 2.1% |
02-794-5090 | 2 | 2.1% |
02-335-4764 | 2 | 2.1% |
02-761-9937 | 2 | 2.1% |
02-334-9245 | 1 | 1.0% |
02-336-7656 | 1 | 1.0% |
02-363-5887 | 1 | 1.0% |
Other values (80) | 80 |
Most occurring characters
Value | Count | Frequency (%) |
- | 194 | |
0 | 168 | |
2 | 158 | |
3 | 134 | |
6 | 80 | |
7 | 80 | |
4 | 71 | 6.4% |
1 | 63 | 5.7% |
8 | 62 | 5.6% |
5 | 60 | 5.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 920 | |
Dash Punctuation | 194 | 17.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 168 | |
2 | 158 | |
3 | 134 | |
6 | 80 | |
7 | 80 | |
4 | 71 | |
1 | 63 | 6.8% |
8 | 62 | 6.7% |
5 | 60 | 6.5% |
9 | 44 | 4.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 194 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1114 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 194 | |
0 | 168 | |
2 | 158 | |
3 | 134 | |
6 | 80 | |
7 | 80 | |
4 | 71 | 6.4% |
1 | 63 | 5.7% |
8 | 62 | 5.6% |
5 | 60 | 5.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1114 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 194 | |
0 | 168 | |
2 | 158 | |
3 | 134 | |
6 | 80 | |
7 | 80 | |
4 | 71 | 6.4% |
1 | 63 | 5.7% |
8 | 62 | 5.6% |
5 | 60 | 5.4% |
reprsnt_menu_nm
Text
Distinct | 95 |
---|---|
Distinct (%) | 95.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
쌀국수 | 4 | 3.1% |
삼겹살 | 3 | 2.3% |
양지 | 2 | 1.5% |
아이스크림 | 2 | 1.5% |
돼지갈비 | 2 | 1.5% |
샌드위치 | 2 | 1.5% |
인라멘 | 2 | 1.5% |
아메리카노 | 2 | 1.5% |
유린기 | 1 | 0.8% |
스페셜 | 1 | 0.8% |
Other values (110) | 110 |
Most occurring characters
Value | Count | Frequency (%) |
31 | 5.4% | |
수 | 12 | 2.1% |
국 | 12 | 2.1% |
탕 | 12 | 2.1% |
이 | 11 | 1.9% |
+ | 9 | 1.6% |
살 | 9 | 1.6% |
비 | 8 | 1.4% |
고 | 8 | 1.4% |
삼 | 8 | 1.4% |
Other values (208) | 452 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 504 | |
Space Separator | 31 | 5.4% |
Decimal Number | 13 | 2.3% |
Math Symbol | 10 | 1.7% |
Close Punctuation | 5 | 0.9% |
Open Punctuation | 5 | 0.9% |
Lowercase Letter | 2 | 0.3% |
Uppercase Letter | 2 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
수 | 12 | 2.4% |
국 | 12 | 2.4% |
탕 | 12 | 2.4% |
이 | 11 | 2.2% |
살 | 9 | 1.8% |
비 | 8 | 1.6% |
고 | 8 | 1.6% |
삼 | 8 | 1.6% |
리 | 8 | 1.6% |
트 | 8 | 1.6% |
Other values (194) | 408 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5 | |
4 | 2 | 15.4% |
0 | 2 | 15.4% |
2 | 2 | 15.4% |
3 | 1 | 7.7% |
5 | 1 | 7.7% |
Math Symbol
Value | Count | Frequency (%) |
+ | 9 | |
~ | 1 | 10.0% |
Uppercase Letter
Value | Count | Frequency (%) |
K | 1 | |
O | 1 |
Space Separator
Value | Count | Frequency (%) |
31 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 |
Lowercase Letter
Value | Count | Frequency (%) |
g | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 504 | |
Common | 64 | 11.2% |
Latin | 4 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
수 | 12 | 2.4% |
국 | 12 | 2.4% |
탕 | 12 | 2.4% |
이 | 11 | 2.2% |
살 | 9 | 1.8% |
비 | 8 | 1.6% |
고 | 8 | 1.6% |
삼 | 8 | 1.6% |
리 | 8 | 1.6% |
트 | 8 | 1.6% |
Other values (194) | 408 |
Common
Value | Count | Frequency (%) |
31 | ||
+ | 9 | 14.1% |
1 | 5 | 7.8% |
) | 5 | 7.8% |
( | 5 | 7.8% |
4 | 2 | 3.1% |
0 | 2 | 3.1% |
2 | 2 | 3.1% |
3 | 1 | 1.6% |
~ | 1 | 1.6% |
Latin
Value | Count | Frequency (%) |
g | 2 | |
K | 1 | |
O | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 504 | |
ASCII | 68 | 11.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
31 | ||
+ | 9 | 13.2% |
1 | 5 | 7.4% |
) | 5 | 7.4% |
( | 5 | 7.4% |
4 | 2 | 2.9% |
g | 2 | 2.9% |
0 | 2 | 2.9% |
2 | 2 | 2.9% |
3 | 1 | 1.5% |
Other values (4) | 4 | 5.9% |
Hangul
Value | Count | Frequency (%) |
수 | 12 | 2.4% |
국 | 12 | 2.4% |
탕 | 12 | 2.4% |
이 | 11 | 2.2% |
살 | 9 | 1.8% |
비 | 8 | 1.6% |
고 | 8 | 1.6% |
삼 | 8 | 1.6% |
리 | 8 | 1.6% |
트 | 8 | 1.6% |
Other values (194) | 408 |
menu_pc
Categorical
HIGH CORRELATION
 
Distinct | 49 |
---|---|
Distinct (%) | 49.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
8 000 | |
---|---|
9 000 | |
13 000 | 6 |
5 000 | 6 |
7 000 | 4 |
Other values (44) |
Length
Max length | 12 |
---|---|
Median length | 6 |
Mean length | 5.53 |
Min length | 2 |
Unique
Unique | 29 ? |
---|---|
Unique (%) | 29.0% |
Sample
1st row | 7 000 |
---|---|
2nd row | <NA> |
3rd row | 9 000 |
4th row | 20 000 |
5th row | 8 000 |
Common Values
Value | Count | Frequency (%) |
8 000 | 9 | 9.0% |
9 000 | 8 | 8.0% |
13 000 | 6 | 6.0% |
5 000 | 6 | 6.0% |
7 000 | 4 | 4.0% |
15 000 | 4 | 4.0% |
4 000 | 4 | 4.0% |
14 000 | 3 | 3.0% |
12 000 | 3 | 3.0% |
<NA> | 3 | 3.0% |
Other values (39) | 50 |
Length
Value | Count | Frequency (%) |
000 | 80 | |
8 | 12 | 6.1% |
9 | 10 | 5.1% |
500 | 10 | 5.1% |
5 | 8 | 4.1% |
13 | 6 | 3.0% |
7 | 6 | 3.0% |
15 | 4 | 2.0% |
4 | 4 | 2.0% |
30 | 3 | 1.5% |
Other values (35) | 54 |
base_ymd
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2020-12-31 00:00:00 |
---|---|
Maximum | 2020-12-31 00:00:00 |
entrp_nm | load_addr | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | homepage_url | tel_no | reprsnt_menu_nm | menu_pc | |
---|---|---|---|---|---|---|---|---|---|---|
entrp_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
load_addr | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.996 | 0.997 | 0.978 |
city_gn_gu_cd | 1.000 | 1.000 | 1.000 | 0.954 | 0.366 | 0.954 | NaN | 1.000 | 1.000 | 0.000 |
xpos_lo | 1.000 | 1.000 | 0.954 | 1.000 | NaN | 0.963 | NaN | NaN | 1.000 | NaN |
ypos_la | 1.000 | 1.000 | 0.366 | NaN | 1.000 | NaN | 1.000 | 1.000 | 0.523 | 0.000 |
area_nm | 1.000 | 1.000 | 0.954 | 0.963 | NaN | 1.000 | NaN | NaN | 1.000 | NaN |
homepage_url | 1.000 | 1.000 | NaN | NaN | 1.000 | NaN | 1.000 | 1.000 | 0.989 | 0.741 |
tel_no | 1.000 | 0.996 | 1.000 | NaN | 1.000 | NaN | 1.000 | 1.000 | 0.989 | 0.000 |
reprsnt_menu_nm | 1.000 | 0.997 | 1.000 | 1.000 | 0.523 | 1.000 | 0.989 | 0.989 | 1.000 | 0.993 |
menu_pc | 1.000 | 0.978 | 0.000 | NaN | 0.000 | NaN | 0.741 | 0.000 | 0.993 | 1.000 |
city_gn_gu_cd | menu_pc | city_do_cd | area_nm | |
---|---|---|---|---|
city_gn_gu_cd | 1.000 | 0.000 | 1.000 | 0.798 |
menu_pc | 0.000 | 1.000 | 1.000 | 1.000 |
city_do_cd | 1.000 | 1.000 | 1.000 | 1.000 |
area_nm | 0.798 | 1.000 | 1.000 | 1.000 |
xpos_lo | ypos_la | city_do_cd | city_gn_gu_cd | area_nm | menu_pc | |
---|---|---|---|---|---|---|
xpos_lo | 1.000 | 0.516 | 1.000 | 0.798 | 0.826 | 1.000 |
ypos_la | 0.516 | 1.000 | 1.000 | 0.265 | 1.000 | 0.000 |
city_do_cd | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
city_gn_gu_cd | 0.798 | 0.265 | 1.000 | 1.000 | 0.798 | 0.000 |
area_nm | 0.826 | 1.000 | 1.000 | 0.798 | 1.000 | 1.000 |
menu_pc | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | homepage_url | tel_no | reprsnt_menu_nm | menu_pc | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 고성막국수 | 서울특별시 강서구 방화대로49길 6-7 | 11 | 126.8 | 37.577455 | 37.577455 | 서울 | <NA> | 02-2665-1205 | 물막국수 | 7 000 | 2020-12-31 |
1 | 제주한라국수 | 제주특별자치도 서귀포시 중문동 2048-1 | <NA> | 126.4 | 33.250304 | <NA> | 제주 | <NA> | <NA> | 고기국수 | <NA> | 2020-12-31 |
2 | 덕원 | 서울특별시 영등포구 버드나루로길 6 | 11 | 126.8 | 37.526413 | 37.526413 | 서울 | <NA> | 02-2634-8663 | 꼬리곰탕 | 9 000 | 2020-12-31 |
3 | 길풍식당 | 서울특별시특별시 영등포구 양평로 85 | 11 | 126.9 | 37.566415 | 37.53592 | 서울 | <NA> | 02-2634-1359 | 꼬리탕 | 20 000 | 2020-12-31 |
4 | 대관원 | 서울특별시 영등포구 당산로37길 1 | 11 | 126.8 | 37.52942 | 37.52942 | 서울 | <NA> | 02-2068-8791 | 삼선간짜장 | 8 000 | 2020-12-31 |
5 | 당산마루 능이버섯삼계탕 | 서울특별시특별시 영등포구 선유로54길 9 | 11 | 126.9 | 37.566415 | 37.536159 | 서울 | <NA> | 02-6083-1393 | 능이버섯삼계탕 | 14 000 | 2020-12-31 |
6 | 원조호수삼계탕 | 서울특별시 영등포구 도림로 282 | 11 | 126.9 | 37.50306 | 37.50306 | 서울 | <NA> | 02-833-8948 | 삼계탕 | 13 000 | 2020-12-31 |
7 | 해월정 | 제주특별자치도 제주시 구좌읍 종달리 608 | <NA> | 126.9 | 33.492212 | <NA> | 제주 | <NA> | <NA> | 보말칼국수 | <NA> | 2020-12-31 |
8 | 동일루 | 서울특별시 마포구 포은로 75 | 11 | 126.9 | 37.553828 | 37.553828 | 서울 | <NA> | 02-3144-2221 | 찹쌀탕수육 소 | 15 000 | 2020-12-31 |
9 | 프롬하노이 | 서울특별시특별시 마포구 포은로8길 20 | 11 | 126.9 | 37.566415 | 37.556426 | 서울 | <NA> | 02-337-0301 | 퍼보 | 10 000 | 2020-12-31 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | homepage_url | tel_no | reprsnt_menu_nm | menu_pc | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 마포원조주물럭 | 서울특별시특별시 마포구 토정로 294 | 11 | 126.9 | 37.566415 | 37.540833 | 서울 | <NA> | 02-716-3001 | 주물럭 | 45 000 | 2020-12-31 |
91 | 마포옥 | 서울특별시특별시 마포구 토정로 312 | 11 | 126.9 | 37.566415 | 37.539882 | 서울 | <NA> | 02-716-6661 | 양지 설렁탕 | 14 000 | 2020-12-31 |
92 | 참식당 | 서울특별시 마포구 용강동 43-2 | 11 | 126.9 | 37.540172 | 37.540172 | 서울 | <NA> | 02-706-2432 | 생대구탕 1인 | 20 000 | 2020-12-31 |
93 | 원조조박집 | 서울특별시 마포구 토정로37길 3 | 11 | 126.9 | 37.539819 | 37.539819 | 서울 | <NA> | 02-712-7462 | 돼지갈비 | 14 000 | 2020-12-31 |
94 | 조박집 본관 | 서울특별시특별시 마포구 토정로37길 3 | 11 | 126.9 | 37.566415 | 37.539819 | 서울 | <NA> | 02-712-7462 | 국내산 돼지갈비 | 15 000 | 2020-12-31 |
95 | 현래장 | 서울특별시 마포구 마포대로 20 | 11 | 126.9 | 37.538428 | 37.538428 | 서울 | http://현래장.com | 02-712-0730 | 손옛날짜장 | 5 000 | 2020-12-31 |
96 | 남도포장마차 | 서울특별시 관악구 청룡2길 3 | 11 | 126.9 | 37.481229 | 37.481229 | 서울 | <NA> | 02-871-9121 | 꽃게+수제비탕 | 시가 | 2020-12-31 |
97 | 논밭골 왕갈비탕 | 서울특별시 관악구 청룡길 30 | 11 | 126.9 | 37.480517 | 37.480517 | 서울 | <NA> | 02-875-6493 | 왕갈비탕 | 9 000 | 2020-12-31 |
98 | 도하정 | 서울특별시특별시 마포구 마포대로4길 38 1층 도하정 | 11 | 126.9 | 37.566415 | 37.538037 | 서울 | https://www.instagram.com/dohajung2018/ | 010-9440-6639 | 소고기 듬뿍 곰탕 | 9 000 | 2020-12-31 |
99 | 락희옥 | 서울특별시 마포구 백범로 170 | 11 | 126.9 | 37.544897 | 37.544897 | 서울 | https://lakhee1.blog.me/ | 02-719-9797 | 보쌈 | 30 000 | 2020-12-31 |