Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 100 |
Missing cells | 12 |
Missing cells (%) | 1.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 10.0 KiB |
Average record size in memory | 102.3 B |
Variable types
Text | 4 |
---|---|
Numeric | 5 |
Categorical | 3 |
Dataset
Description | Sample |
---|---|
Author | 레드타이 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=a8920dd0-43e4-11ea-a53b-35560a26d0cc |
city_do_cd is highly overall correlated with city_gn_gu_cd and 2 other fields | High correlation |
city_gn_gu_cd is highly overall correlated with city_do_cd and 2 other fields | High correlation |
xpos_lo is highly overall correlated with area_nm | High correlation |
ypos_la is highly overall correlated with city_do_cd and 2 other fields | High correlation |
area_nm is highly overall correlated with city_do_cd and 3 other fields | High correlation |
base_ymd is highly imbalanced (85.9%) | Imbalance |
homepage_url has 4 (4.0%) missing values | Missing |
chtt_stsfdg_rt has 6 (6.0%) missing values | Missing |
Reproduction
Analysis started | 2023-12-10 09:57:26.231917 |
---|---|
Analysis finished | 2023-12-10 09:57:36.792758 |
Duration | 10.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
entrp_nm
Text
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
호텔 | 77 | |
라마다 | 11 | 3.8% |
제주 | 9 | 3.1% |
서울 | 7 | 2.4% |
더 | 5 | 1.7% |
부산 | 5 | 1.7% |
베니키아 | 5 | 1.7% |
동대문 | 4 | 1.4% |
프리미어 | 3 | 1.0% |
골든 | 3 | 1.0% |
Other values (133) | 159 |
Most occurring characters
Value | Count | Frequency (%) |
188 | ||
호 | 87 | 9.0% |
텔 | 87 | 9.0% |
스 | 36 | 3.7% |
리 | 24 | 2.5% |
트 | 21 | 2.2% |
라 | 18 | 1.9% |
마 | 17 | 1.8% |
아 | 15 | 1.6% |
다 | 13 | 1.3% |
Other values (179) | 459 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 754 | |
Space Separator | 188 | 19.5% |
Uppercase Letter | 18 | 1.9% |
Lowercase Letter | 4 | 0.4% |
Decimal Number | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 87 | 11.5% |
텔 | 87 | 11.5% |
스 | 36 | 4.8% |
리 | 24 | 3.2% |
트 | 21 | 2.8% |
라 | 18 | 2.4% |
마 | 17 | 2.3% |
아 | 15 | 2.0% |
다 | 13 | 1.7% |
이 | 12 | 1.6% |
Other values (161) | 424 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 4 | |
H | 2 | |
D | 2 | |
G | 2 | |
J | 1 | 5.6% |
A | 1 | 5.6% |
N | 1 | 5.6% |
E | 1 | 5.6% |
M | 1 | 5.6% |
Y | 1 | 5.6% |
Other values (2) | 2 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 1 | |
u | 1 | |
t | 1 | |
e | 1 |
Space Separator
Value | Count | Frequency (%) |
188 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 754 | |
Common | 189 | 19.6% |
Latin | 22 | 2.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 87 | 11.5% |
텔 | 87 | 11.5% |
스 | 36 | 4.8% |
리 | 24 | 3.2% |
트 | 21 | 2.8% |
라 | 18 | 2.4% |
마 | 17 | 2.3% |
아 | 15 | 2.0% |
다 | 13 | 1.7% |
이 | 12 | 1.6% |
Other values (161) | 424 |
Latin
Value | Count | Frequency (%) |
S | 4 | |
H | 2 | 9.1% |
D | 2 | 9.1% |
G | 2 | 9.1% |
J | 1 | 4.5% |
i | 1 | 4.5% |
u | 1 | 4.5% |
A | 1 | 4.5% |
N | 1 | 4.5% |
E | 1 | 4.5% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
188 | ||
1 | 1 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 754 | |
ASCII | 211 | 21.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
188 | ||
S | 4 | 1.9% |
H | 2 | 0.9% |
D | 2 | 0.9% |
G | 2 | 0.9% |
J | 1 | 0.5% |
i | 1 | 0.5% |
u | 1 | 0.5% |
A | 1 | 0.5% |
N | 1 | 0.5% |
Other values (8) | 8 | 3.8% |
Hangul
Value | Count | Frequency (%) |
호 | 87 | 11.5% |
텔 | 87 | 11.5% |
스 | 36 | 4.8% |
리 | 24 | 3.2% |
트 | 21 | 2.8% |
라 | 18 | 2.4% |
마 | 17 | 2.3% |
아 | 15 | 2.0% |
다 | 13 | 1.7% |
이 | 12 | 1.6% |
Other values (161) | 424 |
load_addr
Text
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 43 |
---|---|
Median length | 27 |
Mean length | 21.18 |
Min length | 14 |
Characters and Unicode
Total characters | 2118 |
---|---|
Distinct characters | 195 |
Distinct categories | 5 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 98 ? |
---|---|
Unique (%) | 98.0% |
Sample
1st row | 서울특별시 성북구 동소문로20나길 39 동선동 복합빌 |
---|---|
2nd row | 서울특별시 중구 세종대로11길 36 |
3rd row | 서울특별시 용산구 이태원로15길 14-4 |
4th row | 서울특별시 중구 장충단로 226 |
5th row | 부산광역시 해운대구 해운대해변로 271 |
Value | Count | Frequency (%) |
서울특별시 | 46 | 10.3% |
중구 | 16 | 3.6% |
제주특별자치도 | 13 | 2.9% |
경기도 | 12 | 2.7% |
부산광역시 | 11 | 2.5% |
강남구 | 7 | 1.6% |
서귀포시 | 7 | 1.6% |
제주시 | 6 | 1.3% |
인천광역시 | 6 | 1.3% |
해운대구 | 6 | 1.3% |
Other values (259) | 317 |
Most occurring characters
Value | Count | Frequency (%) |
347 | 16.4% | |
시 | 96 | 4.5% |
로 | 87 | 4.1% |
구 | 76 | 3.6% |
1 | 74 | 3.5% |
서 | 68 | 3.2% |
특 | 59 | 2.8% |
별 | 59 | 2.8% |
울 | 51 | 2.4% |
2 | 49 | 2.3% |
Other values (185) | 1152 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1386 | |
Space Separator | 347 | 16.4% |
Decimal Number | 340 | 16.1% |
Dash Punctuation | 23 | 1.1% |
Uppercase Letter | 22 | 1.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 96 | 6.9% |
로 | 87 | 6.3% |
구 | 76 | 5.5% |
서 | 68 | 4.9% |
특 | 59 | 4.3% |
별 | 59 | 4.3% |
울 | 51 | 3.7% |
길 | 45 | 3.2% |
도 | 37 | 2.7% |
대 | 30 | 2.2% |
Other values (161) | 778 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 3 | |
I | 3 | |
H | 2 | |
L | 2 | |
E | 2 | |
T | 2 | |
D | 2 | |
N | 2 | |
S | 1 | 4.5% |
R | 1 | 4.5% |
Other values (2) | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 74 | |
2 | 49 | |
3 | 40 | |
4 | 32 | |
5 | 29 | 8.5% |
6 | 28 | 8.2% |
0 | 25 | 7.4% |
9 | 23 | 6.8% |
7 | 22 | 6.5% |
8 | 18 | 5.3% |
Space Separator
Value | Count | Frequency (%) |
347 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 23 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1386 | |
Common | 710 | |
Latin | 22 | 1.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 96 | 6.9% |
로 | 87 | 6.3% |
구 | 76 | 5.5% |
서 | 68 | 4.9% |
특 | 59 | 4.3% |
별 | 59 | 4.3% |
울 | 51 | 3.7% |
길 | 45 | 3.2% |
도 | 37 | 2.7% |
대 | 30 | 2.2% |
Other values (161) | 778 |
Common
Value | Count | Frequency (%) |
347 | ||
1 | 74 | 10.4% |
2 | 49 | 6.9% |
3 | 40 | 5.6% |
4 | 32 | 4.5% |
5 | 29 | 4.1% |
6 | 28 | 3.9% |
0 | 25 | 3.5% |
- | 23 | 3.2% |
9 | 23 | 3.2% |
Other values (2) | 40 | 5.6% |
Latin
Value | Count | Frequency (%) |
G | 3 | |
I | 3 | |
H | 2 | |
L | 2 | |
E | 2 | |
T | 2 | |
D | 2 | |
N | 2 | |
S | 1 | 4.5% |
R | 1 | 4.5% |
Other values (2) | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1386 | |
ASCII | 732 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
347 | ||
1 | 74 | 10.1% |
2 | 49 | 6.7% |
3 | 40 | 5.5% |
4 | 32 | 4.4% |
5 | 29 | 4.0% |
6 | 28 | 3.8% |
0 | 25 | 3.4% |
- | 23 | 3.1% |
9 | 23 | 3.1% |
Other values (14) | 62 | 8.5% |
Hangul
Value | Count | Frequency (%) |
시 | 96 | 6.9% |
로 | 87 | 6.3% |
구 | 76 | 5.5% |
서 | 68 | 4.9% |
특 | 59 | 4.3% |
별 | 59 | 4.3% |
울 | 51 | 3.7% |
길 | 45 | 3.2% |
도 | 37 | 2.7% |
대 | 30 | 2.2% |
Other values (161) | 778 |
city_do_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26.18 |
Minimum | 11 |
---|---|
Maximum | 50 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 11 |
Q1 | 11 |
median | 26 |
Q3 | 41 |
95-th percentile | 50 |
Maximum | 50 |
Range | 39 |
Interquartile range (IQR) | 30 |
Descriptive statistics
Standard deviation | 15.647235 |
---|---|
Coefficient of variation (CV) | 0.59767895 |
Kurtosis | -1.5802603 |
Mean | 26.18 |
Median Absolute Deviation (MAD) | 15 |
Skewness | 0.32344257 |
Sum | 2618 |
Variance | 244.83596 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 46 | |
50 | 13 | 13.0% |
41 | 12 | 12.0% |
26 | 11 | 11.0% |
28 | 6 | 6.0% |
42 | 5 | 5.0% |
47 | 2 | 2.0% |
44 | 2 | 2.0% |
48 | 1 | 1.0% |
45 | 1 | 1.0% |
Value | Count | Frequency (%) |
11 | 46 | |
26 | 11 | 11.0% |
28 | 6 | 6.0% |
31 | 1 | 1.0% |
41 | 12 | 12.0% |
42 | 5 | 5.0% |
44 | 2 | 2.0% |
45 | 1 | 1.0% |
47 | 2 | 2.0% |
48 | 1 | 1.0% |
Value | Count | Frequency (%) |
50 | 13 | |
48 | 1 | 1.0% |
47 | 2 | 2.0% |
45 | 1 | 1.0% |
44 | 2 | 2.0% |
42 | 5 | 5.0% |
41 | 12 | |
31 | 1 | 1.0% |
28 | 6 | |
26 | 11 |
city_gn_gu_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 45 |
---|---|
Distinct (%) | 45.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26482.51 |
Minimum | 11110 |
---|---|
Maximum | 50130 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11140 |
Q1 | 11440 |
median | 26245 |
Q3 | 41590 |
95-th percentile | 50130 |
Maximum | 50130 |
Range | 39020 |
Interquartile range (IQR) | 30150 |
Descriptive statistics
Standard deviation | 15580.245 |
---|---|
Coefficient of variation (CV) | 0.58832205 |
Kurtosis | -1.583092 |
Mean | 26482.51 |
Median Absolute Deviation (MAD) | 14955 |
Skewness | 0.32392091 |
Sum | 2648251 |
Variance | 2.4274402 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11140 | 12 | 12.0% |
50130 | 7 | 7.0% |
11680 | 7 | 7.0% |
26350 | 6 | 6.0% |
50110 | 6 | 6.0% |
11500 | 5 | 5.0% |
41115 | 4 | 4.0% |
11230 | 3 | 3.0% |
28110 | 3 | 3.0% |
11290 | 3 | 3.0% |
Other values (35) | 44 |
Value | Count | Frequency (%) |
11110 | 2 | 2.0% |
11140 | 12 | |
11170 | 3 | 3.0% |
11230 | 3 | 3.0% |
11290 | 3 | 3.0% |
11305 | 1 | 1.0% |
11440 | 2 | 2.0% |
11500 | 5 | |
11545 | 2 | 2.0% |
11560 | 2 | 2.0% |
Value | Count | Frequency (%) |
50130 | 7 | |
50110 | 6 | |
48170 | 1 | 1.0% |
47940 | 1 | 1.0% |
47130 | 1 | 1.0% |
45190 | 1 | 1.0% |
44200 | 1 | 1.0% |
44130 | 1 | 1.0% |
42830 | 1 | 1.0% |
42760 | 1 | 1.0% |
xpos_lo
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 98 |
---|---|
Distinct (%) | 99.0% |
Missing | 1 |
Missing (%) | 1.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.32792 |
Minimum | 126.37151 |
---|---|
Maximum | 130.8702 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 126.37151 |
---|---|
5-th percentile | 126.51837 |
Q1 | 126.88479 |
median | 127.00776 |
Q3 | 127.10342 |
95-th percentile | 129.16147 |
Maximum | 130.8702 |
Range | 4.4986868 |
Interquartile range (IQR) | 0.2186288 |
Descriptive statistics
Standard deviation | 0.90473109 |
---|---|
Coefficient of variation (CV) | 0.0071055199 |
Kurtosis | 2.0986307 |
Mean | 127.32792 |
Median Absolute Deviation (MAD) | 0.1143925 |
Skewness | 1.7121336 |
Sum | 12605.464 |
Variance | 0.81853835 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.0975344 | 2 | 2.0% |
126.6602847 | 1 | 1.0% |
127.1212276 | 1 | 1.0% |
129.1613998 | 1 | 1.0% |
129.0373247 | 1 | 1.0% |
127.032221 | 1 | 1.0% |
129.0818704 | 1 | 1.0% |
126.6569041 | 1 | 1.0% |
127.0301916 | 1 | 1.0% |
126.5984422 | 1 | 1.0% |
Other values (88) | 88 |
Value | Count | Frequency (%) |
126.3715088 | 1 | |
126.4080885 | 1 | |
126.4837714 | 1 | |
126.5033589 | 1 | |
126.5102947 | 1 | |
126.519265 | 1 | |
126.519934 | 1 | |
126.5218499 | 1 | |
126.5674246 | 1 | |
126.578421 | 1 |
Value | Count | Frequency (%) |
130.8701956 | 1 | |
129.3473935 | 1 | |
129.2774041 | 1 | |
129.1646681 | 1 | |
129.1620947 | 1 | |
129.1613998 | 1 | |
129.1611261 | 1 | |
129.1545873 | 1 | |
129.1328184 | 1 | |
129.0818704 | 1 |
ypos_la
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 98 |
---|---|
Distinct (%) | 99.0% |
Missing | 1 |
Missing (%) | 1.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36.608501 |
Minimum | 33.249384 |
---|---|
Maximum | 38.189922 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 33.249384 |
---|---|
5-th percentile | 33.432346 |
Q1 | 35.313758 |
median | 37.478877 |
Q3 | 37.55706 |
95-th percentile | 37.592834 |
Maximum | 38.189922 |
Range | 4.9405375 |
Interquartile range (IQR) | 2.2433019 |
Descriptive statistics
Standard deviation | 1.4953849 |
---|---|
Coefficient of variation (CV) | 0.040848023 |
Kurtosis | 0.13635974 |
Mean | 36.608501 |
Median Absolute Deviation (MAD) | 0.09233732 |
Skewness | -1.2882885 |
Sum | 3624.2416 |
Variance | 2.2361759 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.50601088 | 2 | 2.0% |
33.54282848 | 1 | 1.0% |
37.38653939 | 1 | 1.0% |
35.16012514 | 1 | 1.0% |
35.09382162 | 1 | 1.0% |
37.2637485 | 1 | 1.0% |
35.21946372 | 1 | 1.0% |
33.54420799 | 1 | 1.0% |
37.50647548 | 1 | 1.0% |
37.47435003 | 1 | 1.0% |
Other values (88) | 88 |
Value | Count | Frequency (%) |
33.24938412 | 1 | |
33.24986732 | 1 | |
33.25142402 | 1 | |
33.25447711 | 1 | |
33.254517 | 1 | |
33.45210482 | 1 | |
33.46598674 | 1 | |
33.4857485 | 1 | |
33.50067983 | 1 | |
33.51287223 | 1 |
Value | Count | Frequency (%) |
38.18992163 | 1 | |
38.11488063 | 1 | |
37.65025711 | 1 | |
37.64757519 | 1 | |
37.5938887 | 1 | |
37.59271709 | 1 | |
37.58871323 | 1 | |
37.57617521 | 1 | |
37.57446681 | 1 | |
37.5696537 | 1 |
area_nm
Categorical
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 12.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 | |
---|---|
제주 | |
경기 | |
부산 | |
인천 | |
Other values (7) |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.01 |
Min length | 2 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 부산 |
Common Values
Value | Count | Frequency (%) |
서울 | 46 | |
제주 | 13 | 13.0% |
경기 | 12 | 12.0% |
부산 | 11 | 11.0% |
인천 | 6 | 6.0% |
강원 | 5 | 5.0% |
충남 | 2 | 2.0% |
경남 | 1 | 1.0% |
경북 | 1 | 1.0% |
전북 | 1 | 1.0% |
Other values (2) | 2 | 2.0% |
Length
Value | Count | Frequency (%) |
서울 | 46 | |
제주 | 13 | 13.0% |
경기 | 12 | 12.0% |
부산 | 11 | 11.0% |
인천 | 6 | 6.0% |
강원 | 5 | 5.0% |
충남 | 2 | 2.0% |
경남 | 1 | 1.0% |
경북 | 1 | 1.0% |
전북 | 1 | 1.0% |
Other values (2) | 2 | 2.0% |
homepage_url
Text
MISSING
 
Distinct | 94 |
---|---|
Distinct (%) | 97.9% |
Missing | 4 |
Missing (%) | 4.0% |
Memory size | 932.0 B |
Length
Max length | 50 |
---|---|
Median length | 32 |
Mean length | 25.208333 |
Min length | 9 |
Characters and Unicode
Total characters | 2420 |
---|---|
Distinct characters | 41 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 92 ? |
---|---|
Unique (%) | 95.8% |
Sample
1st row | www.dhnaissance.com |
---|---|
2nd row | www.enahotel.co.kr |
3rd row | gv-residence.com |
4th row | http://seouldongdaemun.splaisir.com |
5th row | mshotel.alltheway.kr |
Value | Count | Frequency (%) |
https://www.ramadaencorejejuseogwipo.com | 2 | 2.1% |
www.dhnaissance.com | 2 | 2.1% |
www.hotelthem.com | 1 | 1.0% |
www.riverpark.co.kr | 1 | 1.0% |
http://www.suwonhotel.co.kr/kor | 1 | 1.0% |
www.jshotelbundang.com | 1 | 1.0% |
www.busanbusinesshotel.com | 1 | 1.0% |
www.vellasuitehotel.co.kr | 1 | 1.0% |
http://www.bestincityhotel.co.kr | 1 | 1.0% |
www.hotel-bestone.com | 1 | 1.0% |
Other values (84) | 84 |
Most occurring characters
Value | Count | Frequency (%) |
o | 246 | 10.2% |
w | 231 | 9.5% |
. | 213 | 8.8% |
t | 189 | 7.8% |
e | 181 | 7.5% |
a | 135 | 5.6% |
h | 123 | 5.1% |
c | 114 | 4.7% |
/ | 102 | 4.2% |
l | 101 | 4.2% |
Other values (31) | 785 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2052 | |
Other Punctuation | 350 | 14.5% |
Dash Punctuation | 7 | 0.3% |
Uppercase Letter | 6 | 0.2% |
Decimal Number | 5 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 246 | |
w | 231 | |
t | 189 | 9.2% |
e | 181 | 8.8% |
a | 135 | 6.6% |
h | 123 | 6.0% |
c | 114 | 5.6% |
l | 101 | 4.9% |
m | 97 | 4.7% |
n | 97 | 4.7% |
Other values (16) | 538 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 2 | |
K | 1 | |
R | 1 | |
M | 1 | |
G | 1 |
Decimal Number
Value | Count | Frequency (%) |
3 | 1 | |
9 | 1 | |
5 | 1 | |
0 | 1 | |
2 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 213 | |
/ | 102 | |
: | 34 | 9.7% |
? | 1 | 0.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2058 | |
Common | 362 | 15.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 246 | |
w | 231 | |
t | 189 | 9.2% |
e | 181 | 8.8% |
a | 135 | 6.6% |
h | 123 | 6.0% |
c | 114 | 5.5% |
l | 101 | 4.9% |
m | 97 | 4.7% |
n | 97 | 4.7% |
Other values (21) | 544 |
Common
Value | Count | Frequency (%) |
. | 213 | |
/ | 102 | |
: | 34 | 9.4% |
- | 7 | 1.9% |
3 | 1 | 0.3% |
9 | 1 | 0.3% |
5 | 1 | 0.3% |
0 | 1 | 0.3% |
2 | 1 | 0.3% |
? | 1 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2420 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 246 | 10.2% |
w | 231 | 9.5% |
. | 213 | 8.8% |
t | 189 | 7.8% |
e | 181 | 7.5% |
a | 135 | 5.6% |
h | 123 | 5.1% |
c | 114 | 4.7% |
/ | 102 | 4.2% |
l | 101 | 4.2% |
Other values (31) | 785 |
hotel_grad
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
3급 | |
---|---|
특2급 | |
1급 | 6 |
2급 | 3 |
특1급 | 2 |
Other values (2) | 2 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.22 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 3급 |
---|---|
2nd row | 특2급 |
3rd row | 3급 |
4th row | 특2급 |
5th row | 3급 |
Common Values
Value | Count | Frequency (%) |
3급 | 69 | |
특2급 | 18 | 18.0% |
1급 | 6 | 6.0% |
2급 | 3 | 3.0% |
특1급 | 2 | 2.0% |
<NA> | 1 | 1.0% |
4급 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3급 | 69 | |
특2급 | 18 | 18.0% |
1급 | 6 | 6.0% |
2급 | 3 | 3.0% |
특1급 | 2 | 2.0% |
na | 1 | 1.0% |
4급 | 1 | 1.0% |
tel_no
Text
Distinct | 98 |
---|---|
Distinct (%) | 98.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 11.77 |
Min length | 9 |
Characters and Unicode
Total characters | 1177 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 96 ? |
---|---|
Unique (%) | 96.0% |
Sample
1st row | 02-921-2080 |
---|---|
2nd row | 02-6020-7000 |
3rd row | 02-797-5800 |
4th row | 02-2198-1212 |
5th row | 051-741-3838 |
Value | Count | Frequency (%) |
02-921-2080 | 2 | 2.0% |
064-735-2000 | 2 | 2.0% |
051-720-9000 | 1 | 1.0% |
02-2277-4917 | 1 | 1.0% |
031-236-7112 | 1 | 1.0% |
051-243-8001 | 1 | 1.0% |
051-808-2000 | 1 | 1.0% |
031-231-2121 | 1 | 1.0% |
051-464-8883 | 1 | 1.0% |
064-731-3700 | 1 | 1.0% |
Other values (88) | 88 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 291 | |
- | 199 | |
2 | 137 | |
1 | 111 | 9.4% |
3 | 89 | 7.6% |
7 | 77 | 6.5% |
5 | 69 | 5.9% |
6 | 61 | 5.2% |
4 | 54 | 4.6% |
9 | 49 | 4.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 978 | |
Dash Punctuation | 199 | 16.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 291 | |
2 | 137 | |
1 | 111 | 11.3% |
3 | 89 | 9.1% |
7 | 77 | 7.9% |
5 | 69 | 7.1% |
6 | 61 | 6.2% |
4 | 54 | 5.5% |
9 | 49 | 5.0% |
8 | 40 | 4.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 199 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1177 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 291 | |
- | 199 | |
2 | 137 | |
1 | 111 | 9.4% |
3 | 89 | 7.6% |
7 | 77 | 6.5% |
5 | 69 | 5.9% |
6 | 61 | 5.2% |
4 | 54 | 4.6% |
9 | 49 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1177 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 291 | |
- | 199 | |
2 | 137 | |
1 | 111 | 9.4% |
3 | 89 | 7.6% |
7 | 77 | 6.5% |
5 | 69 | 5.9% |
6 | 61 | 5.2% |
4 | 54 | 4.6% |
9 | 49 | 4.2% |
chtt_stsfdg_rt
Real number (ℝ)
MISSING
 
Distinct | 28 |
---|---|
Distinct (%) | 29.8% |
Missing | 6 |
Missing (%) | 6.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.3244681 |
Minimum | 2.1 |
---|---|
Maximum | 4.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2.1 |
---|---|
5-th percentile | 2.1 |
Q1 | 2.425 |
median | 3.25 |
Q3 | 3.9 |
95-th percentile | 4.7 |
Maximum | 4.9 |
Range | 2.8 |
Interquartile range (IQR) | 1.475 |
Descriptive statistics
Standard deviation | 0.87346623 |
---|---|
Coefficient of variation (CV) | 0.26273864 |
Kurtosis | -1.3074575 |
Mean | 3.3244681 |
Median Absolute Deviation (MAD) | 0.75 |
Skewness | 0.14724286 |
Sum | 312.5 |
Variance | 0.76294326 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2.3 | 9 | 9.0% |
3.9 | 7 | 7.0% |
3.7 | 7 | 7.0% |
2.1 | 6 | 6.0% |
4.6 | 6 | 6.0% |
3.2 | 5 | 5.0% |
2.4 | 5 | 5.0% |
2.2 | 4 | 4.0% |
4.5 | 4 | 4.0% |
2.8 | 4 | 4.0% |
Other values (18) | 37 | |
(Missing) | 6 | 6.0% |
Value | Count | Frequency (%) |
2.1 | 6 | |
2.2 | 4 | |
2.3 | 9 | |
2.4 | 5 | |
2.5 | 3 | 3.0% |
2.6 | 3 | 3.0% |
2.7 | 2 | 2.0% |
2.8 | 4 | |
2.9 | 2 | 2.0% |
3.0 | 1 | 1.0% |
Value | Count | Frequency (%) |
4.9 | 1 | 1.0% |
4.8 | 2 | 2.0% |
4.7 | 3 | |
4.6 | 6 | |
4.5 | 4 | |
4.3 | 1 | 1.0% |
4.2 | 3 | |
4.1 | 2 | 2.0% |
4.0 | 1 | 1.0% |
3.9 | 7 |
base_ymd
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019-12-09 | |
---|---|
2020-12-31 | 2 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019-12-09 |
---|---|
2nd row | 2019-12-09 |
3rd row | 2019-12-09 |
4th row | 2019-12-09 |
5th row | 2019-12-09 |
Common Values
Value | Count | Frequency (%) |
2019-12-09 | 98 | |
2020-12-31 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019-12-09 | 98 | |
2020-12-31 | 2 | 2.0% |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | homepage_url | hotel_grad | tel_no | chtt_stsfdg_rt | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
entrp_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
load_addr | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
city_do_cd | 1.000 | 1.000 | 1.000 | 0.999 | 0.731 | 0.947 | 1.000 | 1.000 | 0.369 | 1.000 | 0.313 | 0.000 |
city_gn_gu_cd | 1.000 | 1.000 | 0.999 | 1.000 | 0.754 | 0.934 | 0.987 | 1.000 | 0.146 | 1.000 | 0.336 | 0.000 |
xpos_lo | 1.000 | 1.000 | 0.731 | 0.754 | 1.000 | 0.747 | 0.928 | 1.000 | 0.438 | 1.000 | 0.000 | 0.000 |
ypos_la | 1.000 | 1.000 | 0.947 | 0.934 | 0.747 | 1.000 | 0.956 | 1.000 | 0.495 | 1.000 | 0.000 | 0.000 |
area_nm | 1.000 | 1.000 | 1.000 | 0.987 | 0.928 | 0.956 | 1.000 | 1.000 | 0.840 | 1.000 | 0.219 | 0.000 |
homepage_url | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.976 | 0.000 |
hotel_grad | 1.000 | 1.000 | 0.369 | 0.146 | 0.438 | 0.495 | 0.840 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 |
tel_no | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.976 | 0.000 |
chtt_stsfdg_rt | 1.000 | 1.000 | 0.313 | 0.336 | 0.000 | 0.000 | 0.219 | 0.976 | 0.000 | 0.976 | 1.000 | 0.000 |
base_ymd | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
hotel_grad | area_nm | base_ymd | |
---|---|---|---|
hotel_grad | 1.000 | 0.474 | 0.000 |
area_nm | 0.474 | 1.000 | 0.000 |
base_ymd | 0.000 | 0.000 | 1.000 |
city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | chtt_stsfdg_rt | area_nm | hotel_grad | base_ymd | |
---|---|---|---|---|---|---|---|---|
city_do_cd | 1.000 | 0.949 | -0.070 | -0.705 | -0.214 | 0.973 | 0.227 | 0.000 |
city_gn_gu_cd | 0.949 | 1.000 | -0.040 | -0.711 | -0.182 | 0.946 | 0.086 | 0.000 |
xpos_lo | -0.070 | -0.040 | 1.000 | 0.029 | 0.106 | 0.705 | 0.257 | 0.000 |
ypos_la | -0.705 | -0.711 | 0.029 | 1.000 | 0.152 | 0.852 | 0.320 | 0.000 |
chtt_stsfdg_rt | -0.214 | -0.182 | 0.106 | 0.152 | 1.000 | 0.100 | 0.000 | 0.000 |
area_nm | 0.973 | 0.946 | 0.705 | 0.852 | 0.100 | 1.000 | 0.474 | 0.000 |
hotel_grad | 0.227 | 0.086 | 0.257 | 0.320 | 0.000 | 0.474 | 1.000 | 0.000 |
base_ymd | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | homepage_url | hotel_grad | tel_no | chtt_stsfdg_rt | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | DH 네상스 호텔 | 서울특별시 성북구 동소문로20나길 39 동선동 복합빌 | 11 | 11290 | 127.097534 | 37.506011 | 서울 | www.dhnaissance.com | 3급 | 02-921-2080 | 2.3 | 2019-12-09 |
1 | ENA Suite 호텔 | 서울특별시 중구 세종대로11길 36 | 11 | 11140 | 126.977968 | 37.567946 | 서울 | www.enahotel.co.kr | 특2급 | 02-6020-7000 | 2.8 | 2019-12-09 |
2 | GV 레지던스 | 서울특별시 용산구 이태원로15길 14-4 | 11 | 11170 | 126.972322 | 37.54084 | 서울 | gv-residence.com | 3급 | 02-797-5800 | 4.7 | 2019-12-09 |
3 | KY 헤리티지 호텔 | 서울특별시 중구 장충단로 226 | 11 | 11140 | 127.00105 | 37.550703 | 서울 | http://seouldongdaemun.splaisir.com | 특2급 | 02-2198-1212 | 2.5 | 2019-12-09 |
4 | MS 호텔 | 부산광역시 해운대구 해운대해변로 271 | 26 | 26350 | 129.164668 | 35.161447 | 부산 | mshotel.alltheway.kr | 3급 | 051-741-3838 | 4.1 | 2019-12-09 |
5 | SG 관광 호텔 | 인천광역시 서구 탁옥로51번길 13-9 SG관광호텔 | 28 | 28260 | 126.67434 | 37.545036 | 인천 | sghotel.kr | 3급 | 032-562-0512 | 3.9 | 2019-12-09 |
6 | 가야 라트리 호텔 | 서울특별시 용산구 한강대로 253 가야라트리 호텔 | 11 | 11170 | 126.972141 | 37.541439 | 서울 | kayalatreehotel.com | 3급 | 02-798-5101 | 3.4 | 2019-12-09 |
7 | 강남 아르누보 씨티 | 서울특별시 서초구 서초대로74길 49 | 11 | 11650 | 127.018784 | 37.593889 | 서울 | www.gnanhotel.com | 3급 | 02-580-7500 | 3.9 | 2019-12-09 |
8 | 강남 패밀리 호텔 | 서울특별시 강남구 봉은사로 143 운현오피스텔 | 11 | 11680 | 127.045279 | 37.510347 | 서울 | www.gangnamfamilyhotel.com | 3급 | 02-6474-1515 | 4.6 | 2019-12-09 |
9 | 골드 리버 호텔 | 서울특별시 금천구 서부샛길 584 | 11 | 11545 | 127.026073 | 37.576175 | 서울 | goldriverhotel.co.kr | 3급 | 02-6021-8100 | 2.6 | 2019-12-09 |
entrp_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | area_nm | homepage_url | hotel_grad | tel_no | chtt_stsfdg_rt | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 이 천안호텔 | 충청남도 천안시 서북구 성정동 734-2 | 44 | 44130 | 127.140241 | 36.81241 | 충남 | http://www.cheonanhotel.kr/ | 2급 | 041-592-0000 | 3.7 | 2019-12-09 |
91 | 제주 라마다 앙코르 이스트 호텔 | 제주특별자치도 서귀포시 서호동 | 50 | 50130 | 126.519934 | 33.254517 | 제주 | https://www.ramadaencorejejuseogwipo.com/ | 3급 | 064-735-2000 | 2.6 | 2019-12-09 |
92 | 제주 하나 호텔 | 제주특별자치도 서귀포시 중문관광로72번길 53 | 50 | 50130 | 126.408089 | 33.249867 | 제주 | http://www.hotelhana.co.kr/main/ | 특2급 | 064-738-7001 | 4.2 | 2019-12-09 |
93 | 코업 시티 호텔 성산 | 제주특별자치도 서귀포시 성산읍 성산등용로 28 | 50 | 50130 | 126.931927 | 33.465987 | 제주 | https://www.coopcityhotel-seongsan.co.kr/ | 3급 | 064-780-9800 | 2.1 | 2019-12-09 |
94 | 호텔 노블레스 제주 | 제주특별자치도 제주시 월성로4길 19 | 50 | 50110 | 126.503359 | 33.50068 | 제주 | hotelnoblessejeju.modoo.at | 3급 | 064-748-7161 | 2.7 | 2019-12-09 |
95 | 호텔 로베로 | 제주특별자치도 제주시 관덕로 26 | 50 | 50110 | 126.52185 | 33.512872 | 제주 | stazhoteljejurobero.com/ | 1급 | 064-757-7111 | 4.8 | 2019-12-09 |
96 | 호텔 위드 제주 | 제주특별자치도 제주시 노연로 34 | 50 | 50110 | 126.483771 | 33.485748 | 제주 | www.hotelwithjeju.com/ | 특2급 | 02-522-5873 | 2.4 | 2019-12-09 |
97 | 힐리언스선마을 | 강원도 홍천군 서면 종자산길 122 | 42 | 42720 | 127.630802 | 37.650257 | 강원 | https://www.healience.co.kr/ | 3급 | 033-434-2772 | 2.3 | 2019-12-09 |
98 | DH 네상스 호텔 | 서울특별시 성북구 동소문로20나길 39 동선동 복합빌 | 11 | 11290 | 127.097534 | 37.506011 | 서울 | www.dhnaissance.com | 3급 | 02-921-2080 | 2.3 | 2020-12-31 |
99 | 코리아나호텔 | 서울특별시 중구 태평로1가 세종대로 135 | 11 | 11140 | <NA> | <NA> | 서울 | https://www.koreanahotel.com/index.htm? | 특2급 | 02-2171-7000 | 3.0 | 2020-12-31 |