Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 100 |
Missing cells | 25 |
Missing cells (%) | 1.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.6 KiB |
Average record size in memory | 118.3 B |
Variable types
Text | 4 |
---|---|
Numeric | 4 |
Categorical | 6 |
Dataset
Description | Sample |
---|---|
Author | 레드타이 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=a8257705-c99d-45e4-b3bc-f54660c8c141 |
base_ymd has constant value "" | Constant |
trrsrt3_nm is highly overall correlated with city_do_cd and 6 other fields | High correlation |
trrsrt2_nm is highly overall correlated with city_do_cd and 6 other fields | High correlation |
trrsrt1_nm is highly overall correlated with city_do_cd and 6 other fields | High correlation |
city_do_cd is highly overall correlated with city_gn_gu_cd and 5 other fields | High correlation |
city_gn_gu_cd is highly overall correlated with city_do_cd and 5 other fields | High correlation |
xpos_lo is highly overall correlated with area_nm and 3 other fields | High correlation |
ypos_la is highly overall correlated with city_do_cd and 5 other fields | High correlation |
area_nm is highly overall correlated with city_do_cd and 6 other fields | High correlation |
tel_no has 6 (6.0%) missing values | Missing |
homepage_url has 19 (19.0%) missing values | Missing |
entrp_nm has unique values | Unique |
load_addr has unique values | Unique |
xpos_lo has unique values | Unique |
ypos_la has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:20:15.229593 |
---|---|
Analysis finished | 2023-12-10 10:20:20.333201 |
Duration | 5.1 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
entrp_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
호텔 | 64 | |
프리미어 | 7 | 3.1% |
서울 | 6 | 2.6% |
강남 | 5 | 2.2% |
더 | 5 | 2.2% |
디자이너스 | 5 | 2.2% |
컬리넌 | 5 | 2.2% |
베스트웨스턴 | 3 | 1.3% |
종로 | 3 | 1.3% |
관광 | 3 | 1.3% |
Other values (109) | 121 |
Most occurring characters
Value | Count | Frequency (%) |
127 | 14.4% | |
호 | 94 | 10.6% |
텔 | 88 | 9.9% |
스 | 42 | 4.7% |
이 | 24 | 2.7% |
리 | 21 | 2.4% |
트 | 15 | 1.7% |
아 | 15 | 1.7% |
서 | 14 | 1.6% |
동 | 13 | 1.5% |
Other values (157) | 432 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 744 | |
Space Separator | 127 | 14.4% |
Decimal Number | 7 | 0.8% |
Uppercase Letter | 7 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 94 | 12.6% |
텔 | 88 | 11.8% |
스 | 42 | 5.6% |
이 | 24 | 3.2% |
리 | 21 | 2.8% |
트 | 15 | 2.0% |
아 | 15 | 2.0% |
서 | 14 | 1.9% |
동 | 13 | 1.7% |
어 | 12 | 1.6% |
Other values (147) | 406 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 2 | |
J | 1 | |
E | 1 | |
T | 1 | |
I | 1 | |
M | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
2 | 3 | |
7 | 1 | 14.3% |
Space Separator
Value | Count | Frequency (%) |
127 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 744 | |
Common | 134 | 15.1% |
Latin | 7 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 94 | 12.6% |
텔 | 88 | 11.8% |
스 | 42 | 5.6% |
이 | 24 | 3.2% |
리 | 21 | 2.8% |
트 | 15 | 2.0% |
아 | 15 | 2.0% |
서 | 14 | 1.9% |
동 | 13 | 1.7% |
어 | 12 | 1.6% |
Other values (147) | 406 |
Latin
Value | Count | Frequency (%) |
S | 2 | |
J | 1 | |
E | 1 | |
T | 1 | |
I | 1 | |
M | 1 |
Common
Value | Count | Frequency (%) |
127 | ||
1 | 3 | 2.2% |
2 | 3 | 2.2% |
7 | 1 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 744 | |
ASCII | 141 | 15.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
127 | ||
1 | 3 | 2.1% |
2 | 3 | 2.1% |
S | 2 | 1.4% |
J | 1 | 0.7% |
E | 1 | 0.7% |
T | 1 | 0.7% |
7 | 1 | 0.7% |
I | 1 | 0.7% |
M | 1 | 0.7% |
Hangul
Value | Count | Frequency (%) |
호 | 94 | 12.6% |
텔 | 88 | 11.8% |
스 | 42 | 5.6% |
이 | 24 | 3.2% |
리 | 21 | 2.8% |
트 | 15 | 2.0% |
아 | 15 | 2.0% |
서 | 14 | 1.9% |
동 | 13 | 1.7% |
어 | 12 | 1.6% |
Other values (147) | 406 |
load_addr
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 46 |
---|---|
Median length | 29.5 |
Mean length | 20.91 |
Min length | 15 |
Characters and Unicode
Total characters | 2091 |
---|---|
Distinct characters | 197 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 경기도 구리시 안골로57번길 10-6 뉴호스텔모텔 |
---|---|
2nd row | 제주특별자치도 제주시 삼무로 48 |
3rd row | 경기도 남양주시 별내2로 70 호텔 더 메이 |
4th row | 경기도 성남시 분당구 황새울로311번길 36 |
5th row | 경기도 수원시 권선구 권선로 669번길 26 |
Value | Count | Frequency (%) |
서울특별시 | 68 | 15.3% |
경기도 | 17 | 3.8% |
강남구 | 14 | 3.1% |
종로구 | 11 | 2.5% |
부산광역시 | 10 | 2.2% |
중구 | 9 | 2.0% |
강서구 | 7 | 1.6% |
영등포구 | 7 | 1.6% |
수원시 | 5 | 1.1% |
서초구 | 5 | 1.1% |
Other values (246) | 292 |
Most occurring characters
Value | Count | Frequency (%) |
345 | 16.5% | |
시 | 103 | 4.9% |
로 | 102 | 4.9% |
구 | 93 | 4.4% |
서 | 83 | 4.0% |
별 | 73 | 3.5% |
특 | 72 | 3.4% |
울 | 70 | 3.3% |
1 | 69 | 3.3% |
길 | 49 | 2.3% |
Other values (187) | 1032 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1377 | |
Space Separator | 345 | 16.5% |
Decimal Number | 336 | 16.1% |
Dash Punctuation | 19 | 0.9% |
Uppercase Letter | 8 | 0.4% |
Lowercase Letter | 4 | 0.2% |
Open Punctuation | 1 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 103 | 7.5% |
로 | 102 | 7.4% |
구 | 93 | 6.8% |
서 | 83 | 6.0% |
별 | 73 | 5.3% |
특 | 72 | 5.2% |
울 | 70 | 5.1% |
길 | 49 | 3.6% |
강 | 26 | 1.9% |
대 | 26 | 1.9% |
Other values (164) | 680 |
Decimal Number
Value | Count | Frequency (%) |
1 | 69 | |
2 | 46 | |
3 | 30 | |
7 | 30 | |
8 | 30 | |
5 | 29 | |
9 | 28 | |
6 | 28 | |
4 | 26 | 7.7% |
0 | 20 | 6.0% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 2 | |
A | 2 | |
E | 2 | |
H | 1 | |
P | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
l | 1 | |
t | 1 | |
o | 1 | |
e | 1 |
Space Separator
Value | Count | Frequency (%) |
345 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1377 | |
Common | 702 | |
Latin | 12 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 103 | 7.5% |
로 | 102 | 7.4% |
구 | 93 | 6.8% |
서 | 83 | 6.0% |
별 | 73 | 5.3% |
특 | 72 | 5.2% |
울 | 70 | 5.1% |
길 | 49 | 3.6% |
강 | 26 | 1.9% |
대 | 26 | 1.9% |
Other values (164) | 680 |
Common
Value | Count | Frequency (%) |
345 | ||
1 | 69 | 9.8% |
2 | 46 | 6.6% |
3 | 30 | 4.3% |
7 | 30 | 4.3% |
8 | 30 | 4.3% |
5 | 29 | 4.1% |
9 | 28 | 4.0% |
6 | 28 | 4.0% |
4 | 26 | 3.7% |
Other values (4) | 41 | 5.8% |
Latin
Value | Count | Frequency (%) |
C | 2 | |
A | 2 | |
E | 2 | |
l | 1 | |
t | 1 | |
o | 1 | |
H | 1 | |
P | 1 | |
e | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1377 | |
ASCII | 714 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
345 | ||
1 | 69 | 9.7% |
2 | 46 | 6.4% |
3 | 30 | 4.2% |
7 | 30 | 4.2% |
8 | 30 | 4.2% |
5 | 29 | 4.1% |
9 | 28 | 3.9% |
6 | 28 | 3.9% |
4 | 26 | 3.6% |
Other values (13) | 53 | 7.4% |
Hangul
Value | Count | Frequency (%) |
시 | 103 | 7.5% |
로 | 102 | 7.4% |
구 | 93 | 6.8% |
서 | 83 | 6.0% |
별 | 73 | 5.3% |
특 | 72 | 5.2% |
울 | 70 | 5.1% |
길 | 49 | 3.6% |
강 | 26 | 1.9% |
대 | 26 | 1.9% |
Other values (164) | 680 |
city_do_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.5 |
Minimum | 11 |
---|---|
Maximum | 50 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 11 |
Q1 | 11 |
median | 11 |
Q3 | 26 |
95-th percentile | 41.3 |
Maximum | 50 |
Range | 39 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 13.315518 |
---|---|
Coefficient of variation (CV) | 0.6828471 |
Kurtosis | -0.4189417 |
Mean | 19.5 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.1307526 |
Sum | 1950 |
Variance | 177.30303 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 68 | |
41 | 17 | 17.0% |
26 | 10 | 10.0% |
50 | 3 | 3.0% |
47 | 1 | 1.0% |
48 | 1 | 1.0% |
Value | Count | Frequency (%) |
11 | 68 | |
26 | 10 | 10.0% |
41 | 17 | 17.0% |
47 | 1 | 1.0% |
48 | 1 | 1.0% |
50 | 3 | 3.0% |
Value | Count | Frequency (%) |
50 | 3 | 3.0% |
48 | 1 | 1.0% |
47 | 1 | 1.0% |
41 | 17 | 17.0% |
26 | 10 | 10.0% |
11 | 68 |
city_gn_gu_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 40 |
---|---|
Distinct (%) | 40.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19866.57 |
Minimum | 11110 |
---|---|
Maximum | 50110 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11110 |
Q1 | 11275 |
median | 11650 |
Q3 | 26350 |
95-th percentile | 41866.15 |
Maximum | 50110 |
Range | 39000 |
Interquartile range (IQR) | 15075 |
Descriptive statistics
Standard deviation | 13236.914 |
---|---|
Coefficient of variation (CV) | 0.66629085 |
Kurtosis | -0.42054612 |
Mean | 19866.57 |
Median Absolute Deviation (MAD) | 510 |
Skewness | 1.1304884 |
Sum | 1986657 |
Variance | 1.7521588 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11680 | 14 | 14.0% |
11110 | 11 | 11.0% |
11140 | 8 | 8.0% |
11500 | 7 | 7.0% |
11560 | 7 | 7.0% |
11650 | 5 | 5.0% |
26350 | 4 | 4.0% |
41115 | 3 | 3.0% |
50110 | 3 | 3.0% |
11170 | 3 | 3.0% |
Other values (30) | 35 |
Value | Count | Frequency (%) |
11110 | 11 | |
11140 | 8 | |
11170 | 3 | 3.0% |
11215 | 2 | 2.0% |
11230 | 1 | 1.0% |
11290 | 1 | 1.0% |
11305 | 1 | 1.0% |
11350 | 1 | 1.0% |
11380 | 1 | 1.0% |
11440 | 1 | 1.0% |
Value | Count | Frequency (%) |
50110 | 3 | |
48310 | 1 | 1.0% |
47113 | 1 | 1.0% |
41590 | 1 | 1.0% |
41500 | 1 | 1.0% |
41480 | 1 | 1.0% |
41390 | 1 | 1.0% |
41370 | 1 | 1.0% |
41360 | 1 | 1.0% |
41310 | 1 | 1.0% |
xpos_lo
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.2234 |
Minimum | 126.49176 |
---|---|
Maximum | 129.35736 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 126.49176 |
---|---|
5-th percentile | 126.79936 |
Q1 | 126.93255 |
median | 127.01257 |
Q3 | 127.04801 |
95-th percentile | 129.11991 |
Maximum | 129.35736 |
Range | 2.8656047 |
Interquartile range (IQR) | 0.11545882 |
Descriptive statistics
Standard deviation | 0.70126902 |
---|---|
Coefficient of variation (CV) | 0.0055121074 |
Kurtosis | 3.5519786 |
Mean | 127.2234 |
Median Absolute Deviation (MAD) | 0.0516874 |
Skewness | 2.2619651 |
Sum | 12722.34 |
Variance | 0.49177824 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.1374986 | 1 | 1.0% |
127.0204833 | 1 | 1.0% |
126.9216068 | 1 | 1.0% |
126.8904958 | 1 | 1.0% |
126.8987369 | 1 | 1.0% |
126.9205137 | 1 | 1.0% |
126.920129 | 1 | 1.0% |
127.0975344 | 1 | 1.0% |
127.0960855 | 1 | 1.0% |
127.0166679 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
126.491759 | 1 | |
126.527765 | 1 | |
126.547803 | 1 | |
126.7402653 | 1 | |
126.7618722 | 1 | |
126.8013283 | 1 | |
126.8142917 | 1 | |
126.8184735 | 1 | |
126.8267215 | 1 | |
126.8357949 | 1 |
Value | Count | Frequency (%) |
129.3573637 | 1 | |
129.165879 | 1 | |
129.1611261 | 1 | |
129.1569244 | 1 | |
129.1565486 | 1 | |
129.1179799 | 1 | |
129.058122 | 1 | |
129.057318 | 1 | |
129.0418332 | 1 | |
129.0373247 | 1 |
ypos_la
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.106472 |
Minimum | 33.445747 |
---|---|
Maximum | 37.715623 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 33.445747 |
---|---|
5-th percentile | 35.100873 |
Q1 | 37.388768 |
median | 37.511294 |
Q3 | 37.564216 |
95-th percentile | 37.597759 |
Maximum | 37.715623 |
Range | 4.2698758 |
Interquartile range (IQR) | 0.17544798 |
Descriptive statistics
Standard deviation | 0.99708362 |
---|---|
Coefficient of variation (CV) | 0.026870882 |
Kurtosis | 4.4468765 |
Mean | 37.106472 |
Median Absolute Deviation (MAD) | 0.05651452 |
Skewness | -2.333788 |
Sum | 3710.6472 |
Variance | 0.99417575 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.59764692 | 1 | 1.0% |
37.49616883 | 1 | 1.0% |
37.51385233 | 1 | 1.0% |
37.54087607 | 1 | 1.0% |
37.53464944 | 1 | 1.0% |
37.53036423 | 1 | 1.0% |
37.52824985 | 1 | 1.0% |
37.50601088 | 1 | 1.0% |
37.50219886 | 1 | 1.0% |
37.59361073 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
33.445747 | 1 | |
33.489376 | 1 | |
33.517558 | 1 | |
34.88593306 | 1 | |
35.09382162 | 1 | |
35.10124405 | 1 | |
35.11615674 | 1 | |
35.151201 | 1 | |
35.15342586 | 1 | |
35.154691 | 1 |
Value | Count | Frequency (%) |
37.7156228 | 1 | |
37.65519408 | 1 | |
37.6466594 | 1 | |
37.61088514 | 1 | |
37.59989609 | 1 | |
37.59764692 | 1 | |
37.59361073 | 1 | |
37.58171189 | 1 | |
37.57625021 | 1 | |
37.57446681 | 1 |
area_nm
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 | |
---|---|
경기 | |
부산 | |
제주 | 3 |
경북 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 경기 |
---|---|
2nd row | 제주 |
3rd row | 경기 |
4th row | 경기 |
5th row | 경기 |
Common Values
Value | Count | Frequency (%) |
서울 | 68 | |
경기 | 17 | 17.0% |
부산 | 10 | 10.0% |
제주 | 3 | 3.0% |
경북 | 1 | 1.0% |
경남 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 68 | |
경기 | 17 | 17.0% |
부산 | 10 | 10.0% |
제주 | 3 | 3.0% |
경북 | 1 | 1.0% |
경남 | 1 | 1.0% |
hotel_grad
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
3 | |
---|---|
2 | |
4 | |
1 | |
<NA> |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.21 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 3 |
3rd row | 3 |
4th row | 3 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
3 | 48 | |
2 | 20 | |
4 | 15 | 15.0% |
1 | 9 | 9.0% |
<NA> | 7 | 7.0% |
5 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3 | 48 | |
2 | 20 | |
4 | 15 | 15.0% |
1 | 9 | 9.0% |
na | 7 | 7.0% |
5 | 1 | 1.0% |
tel_no
Text
MISSING
 
Distinct | 94 |
---|---|
Distinct (%) | 100.0% |
Missing | 6 |
Missing (%) | 6.0% |
Memory size | 932.0 B |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 11.638298 |
Min length | 9 |
Characters and Unicode
Total characters | 1094 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 94 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 064-797-0000 |
---|---|
2nd row | 031-551-8700 |
3rd row | 1877-8006 |
4th row | 031-230-5000 |
5th row | 064-795-7000 |
Value | Count | Frequency (%) |
051-805-9901 | 1 | 1.1% |
02-538-5177 | 1 | 1.1% |
02-2014-1111 | 1 | 1.1% |
02-2671-9995 | 1 | 1.1% |
02-783-2233 | 1 | 1.1% |
02-786-5511 | 1 | 1.1% |
02-2143-3000 | 1 | 1.1% |
02-425-1000 | 1 | 1.1% |
02-925-7000 | 1 | 1.1% |
02-3474-3399 | 1 | 1.1% |
Other values (84) | 84 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 261 | |
- | 187 | |
2 | 146 | |
5 | 80 | 7.3% |
1 | 79 | 7.2% |
3 | 70 | 6.4% |
7 | 66 | 6.0% |
6 | 59 | 5.4% |
8 | 54 | 4.9% |
9 | 47 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 907 | |
Dash Punctuation | 187 | 17.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 261 | |
2 | 146 | |
5 | 80 | 8.8% |
1 | 79 | 8.7% |
3 | 70 | 7.7% |
7 | 66 | 7.3% |
6 | 59 | 6.5% |
8 | 54 | 6.0% |
9 | 47 | 5.2% |
4 | 45 | 5.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 187 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1094 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 261 | |
- | 187 | |
2 | 146 | |
5 | 80 | 7.3% |
1 | 79 | 7.2% |
3 | 70 | 6.4% |
7 | 66 | 6.0% |
6 | 59 | 5.4% |
8 | 54 | 4.9% |
9 | 47 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1094 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 261 | |
- | 187 | |
2 | 146 | |
5 | 80 | 7.3% |
1 | 79 | 7.2% |
3 | 70 | 6.4% |
7 | 66 | 6.0% |
6 | 59 | 5.4% |
8 | 54 | 4.9% |
9 | 47 | 4.3% |
homepage_url
Text
MISSING
 
Distinct | 77 |
---|---|
Distinct (%) | 95.1% |
Missing | 19 |
Missing (%) | 19.0% |
Memory size | 932.0 B |
Length
Max length | 76 |
---|---|
Median length | 42 |
Mean length | 32.864198 |
Min length | 19 |
Characters and Unicode
Total characters | 2662 |
---|---|
Distinct characters | 44 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 76 ? |
---|---|
Unique (%) | 93.8% |
Sample
1st row | https://www.skyparkhotel.com/html/main.asp |
---|---|
2nd row | http://www.jshotelbundang.com/ |
3rd row | https://www.ambatel.com/ibis/suwon/ko/main.do |
4th row | http://www.whistlelark.co.kr/ |
5th row | https://hoteltate.modoo.at/ |
Value | Count | Frequency (%) |
https://www.skyparkhotel.com/html/main.asp | 5 | 6.2% |
http://hotelthedesigners.kr | 1 | 1.2% |
https://www.mayfield.co.kr/2017/kor/html/index/index.asp | 1 | 1.2% |
http://www.cshotelseoul.com | 1 | 1.2% |
http://www.hotel-loft.co.kr | 1 | 1.2% |
http://www.hotelbenhur.co.kr | 1 | 1.2% |
http://hotelthedesigners.com | 1 | 1.2% |
http://www.rosanahotel.co.kr | 1 | 1.2% |
http://www.hotelinfini.co.kr/default | 1 | 1.2% |
https://www.hotelahill.com | 1 | 1.2% |
Other values (67) | 67 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 275 | 10.3% |
t | 273 | 10.3% |
w | 197 | 7.4% |
o | 191 | 7.2% |
. | 190 | 7.1% |
h | 178 | 6.7% |
e | 157 | 5.9% |
p | 116 | 4.4% |
a | 110 | 4.1% |
l | 105 | 3.9% |
Other values (34) | 870 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2071 | |
Other Punctuation | 555 | 20.8% |
Decimal Number | 23 | 0.9% |
Uppercase Letter | 6 | 0.2% |
Dash Punctuation | 4 | 0.2% |
Math Symbol | 2 | 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 273 | |
w | 197 | 9.5% |
o | 191 | 9.2% |
h | 178 | 8.6% |
e | 157 | 7.6% |
p | 116 | 5.6% |
a | 110 | 5.3% |
l | 105 | 5.1% |
m | 101 | 4.9% |
c | 95 | 4.6% |
Other values (15) | 548 |
Decimal Number
Value | Count | Frequency (%) |
2 | 6 | |
1 | 4 | |
0 | 4 | |
3 | 3 | |
5 | 3 | |
6 | 2 | 8.7% |
7 | 1 | 4.3% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 275 | |
. | 190 | |
: | 83 | 15.0% |
% | 4 | 0.7% |
? | 2 | 0.4% |
# | 1 | 0.2% |
Uppercase Letter
Value | Count | Frequency (%) |
F | 3 | |
R | 2 | |
A | 1 | 16.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Math Symbol
Value | Count | Frequency (%) |
= | 2 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2077 | |
Common | 585 | 22.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 273 | |
w | 197 | 9.5% |
o | 191 | 9.2% |
h | 178 | 8.6% |
e | 157 | 7.6% |
p | 116 | 5.6% |
a | 110 | 5.3% |
l | 105 | 5.1% |
m | 101 | 4.9% |
c | 95 | 4.6% |
Other values (18) | 554 |
Common
Value | Count | Frequency (%) |
/ | 275 | |
. | 190 | |
: | 83 | 14.2% |
2 | 6 | 1.0% |
- | 4 | 0.7% |
1 | 4 | 0.7% |
% | 4 | 0.7% |
0 | 4 | 0.7% |
3 | 3 | 0.5% |
5 | 3 | 0.5% |
Other values (6) | 9 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2662 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 275 | 10.3% |
t | 273 | 10.3% |
w | 197 | 7.4% |
o | 191 | 7.2% |
. | 190 | 7.1% |
h | 178 | 6.7% |
e | 157 | 5.9% |
p | 116 | 4.4% |
a | 110 | 4.1% |
l | 105 | 3.9% |
Other values (34) | 870 |
trrsrt1_nm
Categorical
HIGH CORRELATION
 
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
명동 거리 | |
---|---|
코엑스 | |
경복궁 | |
수원화성 | |
예술의전당 | |
Other values (25) |
Length
Max length | 8 |
---|---|
Median length | 7 |
Mean length | 4.51 |
Min length | 3 |
Unique
Unique | 18 ? |
---|---|
Unique (%) | 18.0% |
Sample
1st row | 제부도 |
---|---|
2nd row | 함덕해수욕장 |
3rd row | 물향기수목원 |
4th row | 바람의언덕 |
5th row | 안양예술공원 |
Common Values
Value | Count | Frequency (%) |
명동 거리 | 20 | |
코엑스 | 16 | |
경복궁 | 9 | 9.0% |
수원화성 | 6 | 6.0% |
예술의전당 | 5 | 5.0% |
서울식물원 | 5 | 5.0% |
여의도 공원 | 5 | 5.0% |
동백섬 | 4 | 4.0% |
안양예술공원 | 4 | 4.0% |
함덕해수욕장 | 3 | 3.0% |
Other values (20) | 23 |
Length
Value | Count | Frequency (%) |
거리 | 23 | |
명동 | 20 | |
코엑스 | 16 | |
경복궁 | 9 | 7.0% |
수원화성 | 6 | 4.7% |
예술의전당 | 5 | 3.9% |
서울식물원 | 5 | 3.9% |
여의도 | 5 | 3.9% |
공원 | 5 | 3.9% |
동백섬 | 4 | 3.1% |
Other values (23) | 31 |
trrsrt2_nm
Categorical
HIGH CORRELATION
 
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
N서울타워 | |
---|---|
가로수길 | |
인사동 | |
화성행궁 | |
세빛섬 | |
Other values (25) |
Length
Max length | 13 |
---|---|
Median length | 8.5 |
Mean length | 4.88 |
Min length | 2 |
Unique
Unique | 18 ? |
---|---|
Unique (%) | 18.0% |
Sample
1st row | 궁평항 |
---|---|
2nd row | 우도 |
3rd row | 유엔군 초전기념관 |
4th row | 외도 |
5th row | 안양중앙공원 |
Common Values
Value | Count | Frequency (%) |
N서울타워 | 20 | |
가로수길 | 16 | |
인사동 | 9 | 9.0% |
화성행궁 | 6 | 6.0% |
세빛섬 | 5 | 5.0% |
우장산 | 5 | 5.0% |
63빌딩 | 5 | 5.0% |
SEALIFE 아쿠아리움 | 4 | 4.0% |
안양중앙공원 | 4 | 4.0% |
우도 | 3 | 3.0% |
Other values (20) | 23 |
Length
Value | Count | Frequency (%) |
n서울타워 | 20 | |
가로수길 | 16 | |
인사동 | 9 | 8.4% |
화성행궁 | 6 | 5.6% |
세빛섬 | 5 | 4.7% |
우장산 | 5 | 4.7% |
63빌딩 | 5 | 4.7% |
sealife | 4 | 3.7% |
아쿠아리움 | 4 | 3.7% |
안양중앙공원 | 4 | 3.7% |
Other values (24) | 29 |
trrsrt3_nm
Categorical
HIGH CORRELATION
 
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
남산공원 | |
---|---|
강남역거리 | |
북촌 한옥마을 | |
광교호수공원 | |
반포한강공원 | |
Other values (25) |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 5.46 |
Min length | 3 |
Unique
Unique | 18 ? |
---|---|
Unique (%) | 18.0% |
Sample
1st row | 전곡항 |
---|---|
2nd row | 만장굴 |
3rd row | 독산성 세마대지 |
4th row | 거제포로수용소유적공원 |
5th row | 안양일번가 |
Common Values
Value | Count | Frequency (%) |
남산공원 | 20 | |
강남역거리 | 16 | |
북촌 한옥마을 | 9 | 9.0% |
광교호수공원 | 6 | 6.0% |
반포한강공원 | 5 | 5.0% |
개화산 | 5 | 5.0% |
타임스퀘어 | 5 | 5.0% |
스파랜드 센텀시티 | 4 | 4.0% |
안양일번가 | 4 | 4.0% |
만장굴 | 3 | 3.0% |
Other values (20) | 23 |
Length
Value | Count | Frequency (%) |
남산공원 | 20 | |
강남역거리 | 16 | |
북촌 | 9 | 7.8% |
한옥마을 | 9 | 7.8% |
광교호수공원 | 6 | 5.2% |
반포한강공원 | 5 | 4.3% |
개화산 | 5 | 4.3% |
타임스퀘어 | 5 | 4.3% |
스파랜드 | 4 | 3.5% |
센텀시티 | 4 | 3.5% |
Other values (24) | 32 |
base_ymd
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019-12-09 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019-12-09 |
---|---|
2nd row | 2019-12-09 |
3rd row | 2019-12-09 |
4th row | 2019-12-09 |
5th row | 2019-12-09 |
Common Values
Value | Count | Frequency (%) |
2019-12-09 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019-12-09 | 100 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | hotel_grad | tel_no | homepage_url | trrsrt1_nm | trrsrt2_nm | trrsrt3_nm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
entrp_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
load_addr | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
city_do_cd | 1.000 | 1.000 | 1.000 | 1.000 | 0.824 | 0.942 | 1.000 | 0.200 | 1.000 | 0.000 | 0.990 | 0.990 | 0.990 |
city_gn_gu_cd | 1.000 | 1.000 | 1.000 | 1.000 | 0.820 | 0.940 | 1.000 | 0.209 | 1.000 | 0.000 | 0.988 | 0.988 | 0.988 |
xpos_lo | 1.000 | 1.000 | 0.824 | 0.820 | 1.000 | 0.800 | 0.868 | 0.000 | 1.000 | 0.852 | 0.907 | 0.907 | 0.907 |
ypos_la | 1.000 | 1.000 | 0.942 | 0.940 | 0.800 | 1.000 | 0.987 | 0.000 | 1.000 | 0.613 | 0.979 | 0.979 | 0.979 |
area_nm | 1.000 | 1.000 | 1.000 | 1.000 | 0.868 | 0.987 | 1.000 | 0.272 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 |
hotel_grad | 1.000 | 1.000 | 0.200 | 0.209 | 0.000 | 0.000 | 0.272 | 1.000 | 1.000 | 0.944 | 0.000 | 0.000 | 0.000 |
tel_no | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
homepage_url | 1.000 | 1.000 | 0.000 | 0.000 | 0.852 | 0.613 | 0.000 | 0.944 | 1.000 | 1.000 | 0.996 | 0.996 | 0.996 |
trrsrt1_nm | 1.000 | 1.000 | 0.990 | 0.988 | 0.907 | 0.979 | 1.000 | 0.000 | 1.000 | 0.996 | 1.000 | 1.000 | 1.000 |
trrsrt2_nm | 1.000 | 1.000 | 0.990 | 0.988 | 0.907 | 0.979 | 1.000 | 0.000 | 1.000 | 0.996 | 1.000 | 1.000 | 1.000 |
trrsrt3_nm | 1.000 | 1.000 | 0.990 | 0.988 | 0.907 | 0.979 | 1.000 | 0.000 | 1.000 | 0.996 | 1.000 | 1.000 | 1.000 |
trrsrt3_nm | area_nm | hotel_grad | trrsrt2_nm | trrsrt1_nm | |
---|---|---|---|---|---|
trrsrt3_nm | 1.000 | 0.838 | 0.000 | 1.000 | 1.000 |
area_nm | 0.838 | 1.000 | 0.102 | 0.838 | 0.838 |
hotel_grad | 0.000 | 0.102 | 1.000 | 0.000 | 0.000 |
trrsrt2_nm | 1.000 | 0.838 | 0.000 | 1.000 | 1.000 |
trrsrt1_nm | 1.000 | 0.838 | 0.000 | 1.000 | 1.000 |
city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | hotel_grad | trrsrt1_nm | trrsrt2_nm | trrsrt3_nm | |
---|---|---|---|---|---|---|---|---|---|
city_do_cd | 1.000 | 0.827 | 0.212 | -0.635 | 0.990 | 0.162 | 0.813 | 0.813 | 0.813 |
city_gn_gu_cd | 0.827 | 1.000 | 0.294 | -0.771 | 0.990 | 0.162 | 0.813 | 0.813 | 0.813 |
xpos_lo | 0.212 | 0.294 | 1.000 | -0.295 | 0.735 | 0.000 | 0.584 | 0.584 | 0.584 |
ypos_la | -0.635 | -0.771 | -0.295 | 1.000 | 0.827 | 0.000 | 0.701 | 0.701 | 0.701 |
area_nm | 0.990 | 0.990 | 0.735 | 0.827 | 1.000 | 0.102 | 0.838 | 0.838 | 0.838 |
hotel_grad | 0.162 | 0.162 | 0.000 | 0.000 | 0.102 | 1.000 | 0.000 | 0.000 | 0.000 |
trrsrt1_nm | 0.813 | 0.813 | 0.584 | 0.701 | 0.838 | 0.000 | 1.000 | 1.000 | 1.000 |
trrsrt2_nm | 0.813 | 0.813 | 0.584 | 0.701 | 0.838 | 0.000 | 1.000 | 1.000 | 1.000 |
trrsrt3_nm | 0.813 | 0.813 | 0.584 | 0.701 | 0.838 | 0.000 | 1.000 | 1.000 | 1.000 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | hotel_grad | tel_no | homepage_url | trrsrt1_nm | trrsrt2_nm | trrsrt3_nm | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 호텔 팝 | 경기도 구리시 안골로57번길 10-6 뉴호스텔모텔 | 41 | 41310 | 127.137499 | 37.597647 | 경기 | <NA> | <NA> | <NA> | 제부도 | 궁평항 | 전곡항 | 2019-12-09 |
1 | 호텔스카이파크제주1호점 | 제주특별자치도 제주시 삼무로 48 | 50 | 50110 | 126.491759 | 33.489376 | 제주 | 3 | 064-797-0000 | https://www.skyparkhotel.com/html/main.asp | 함덕해수욕장 | 우도 | 만장굴 | 2019-12-09 |
2 | 호텔 더메이 | 경기도 남양주시 별내2로 70 호텔 더 메이 | 41 | 41360 | 127.125141 | 37.646659 | 경기 | 3 | 031-551-8700 | <NA> | 물향기수목원 | 유엔군 초전기념관 | 독산성 세마대지 | 2019-12-09 |
3 | JS호텔분당 | 경기도 성남시 분당구 황새울로311번길 36 | 41 | 41135 | 127.121228 | 37.386539 | 경기 | 3 | 1877-8006 | http://www.jshotelbundang.com/ | 바람의언덕 | 외도 | 거제포로수용소유적공원 | 2019-12-09 |
4 | 타소스 호텔 | 경기도 수원시 권선구 권선로 669번길 26 | 41 | 41113 | 127.025264 | 37.260968 | 경기 | <NA> | <NA> | <NA> | 안양예술공원 | 안양중앙공원 | 안양일번가 | 2019-12-09 |
5 | 호매실 호텔 | 경기도 수원시 권선구 금곡로 197번길 17-10 | 41 | 41113 | 126.951318 | 37.274486 | 경기 | 2 | <NA> | <NA> | 안양예술공원 | 안양중앙공원 | 안양일번가 | 2019-12-09 |
6 | 이비스 앰배서더 수원 | 경기도 수원시 팔달구 권광로 132 | 41 | 41115 | 127.031524 | 37.259045 | 경기 | 3 | 031-230-5000 | https://www.ambatel.com/ibis/suwon/ko/main.do | 대부도 | 탄도항 | 안산갈대습지공원 | 2019-12-09 |
7 | 호텔 휘슬락 | 제주특별자치도 제주시 서부두2길 26 | 50 | 50110 | 126.527765 | 33.517558 | 제주 | 4 | 064-795-7000 | http://www.whistlelark.co.kr/ | 함덕해수욕장 | 우도 | 만장굴 | 2019-12-09 |
8 | 호텔 테이트 | 경기도 수원시 팔달구 권광로180번길 53-22 | 41 | 41115 | 127.035025 | 37.263575 | 경기 | 3 | 031-222-6100 | https://hoteltate.modoo.at/ | DMZ | 판문점 | 임진각평화누리공원 | 2019-12-09 |
9 | 알렉스 72 호텔 | 경기도 수원시 팔달구 효원로235번길 35 | 41 | 41115 | 127.027903 | 37.264631 | 경기 | 3 | <NA> | <NA> | 수원화성 | 화성행궁 | 광교호수공원 | 2019-12-09 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | hotel_grad | tel_no | homepage_url | trrsrt1_nm | trrsrt2_nm | trrsrt3_nm | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 서울 로프트 아파트먼트 | 서울특별시 종로구 창경궁로 158 | 11 | 11110 | 126.997611 | 37.57625 | 서울 | 3 | <NA> | <NA> | 서울식물원 | 우장산 | 개화산 | 2019-12-09 |
91 | 호텔 베뉴지 | 서울특별시 종로구 청계천로 117 | 11 | 11110 | 126.990748 | 37.568607 | 서울 | 2 | 02-2223-6500 | http://www.hotelvenueg.com/ | 명동 거리 | N서울타워 | 남산공원 | 2019-12-09 |
92 | 에이퍼스트 호텔 명동 | 서울특별시 중구 다동길 30 | 11 | 11140 | 126.980923 | 37.567642 | 서울 | 3 | 02-768-8777 | http://www.afirsthotelgroup.com/ko/ | 롯데월드 | 서울스카이(제2롯데월드) | 올림픽공원 | 2019-12-09 |
93 | 호텔스카이파크동대문1호점 | 서울특별시 중구 동호로 335 | 11 | 11140 | 127.002105 | 37.564195 | 서울 | 2 | 02-2264-2200 | https://www.skyparkhotel.com/html/main.asp | 명동 거리 | N서울타워 | 남산공원 | 2019-12-09 |
94 | 라마다서울동대문 | 서울특별시 중구 동호로 354 | 11 | 11140 | 127.0028 | 37.5659 | 서울 | 3 | 02-2276-3500 | http://www.ramadaddm.com/main/ | 코엑스 | 가로수길 | 강남역거리 | 2019-12-09 |
95 | 나인트리프리미어호텔명동2 | 서울특별시 중구 마른내로 28 | 11 | 11140 | 126.990819 | 37.564337 | 서울 | 4 | 02-6967-0999 | http://www.ninetreehotels.com/nth2/ | 코엑스 | 가로수길 | 강남역거리 | 2019-12-09 |
96 | 호텔스카이파크명동1호점 | 서울특별시 중구 명동8나길 15 | 11 | 11140 | 126.985293 | 37.564037 | 서울 | 3 | 02-6900-9300 | https://www.skyparkhotel.com/html/main.asp | 명동 거리 | N서울타워 | 남산공원 | 2019-12-09 |
97 | 메트로호텔 | 서울특별시 중구 명동9가길 14 메트로호텔 | 11 | 11140 | 126.985946 | 37.563004 | 서울 | 3 | 02-752-1112 | https://www.metrohotel.co.kr:11031/main.asp | 코엑스 | 가로수길 | 강남역거리 | 2019-12-09 |
98 | 호텔스카이파크센트럴명동점 | 서울특별시 중구 명동9길 16 | 11 | 11140 | 126.985273 | 37.564282 | 서울 | 3 | 02-752-0022 | https://www.skyparkhotel.com/html/main.asp | 명동 거리 | N서울타워 | 남산공원 | 2019-12-09 |
99 | 호텔스카이파크명동2호점 | 서울특별시 중구 명동9길 22 | 11 | 11140 | 126.979467 | 37.568244 | 서울 | 2 | 02-755-0091 | https://www.skyparkhotel.com/html/main.asp | 명동 거리 | N서울타워 | 남산공원 | 2019-12-09 |