Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 100 |
Missing cells | 85 |
Missing cells (%) | 7.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 9.9 KiB |
Average record size in memory | 101.3 B |
Variable types
Categorical | 4 |
---|---|
Text | 5 |
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 레드타이 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=5da758b2-a29a-4fab-89e2-6676a19df1b0 |
base_ymd has constant value "" | Constant |
city_gn_gu_cd is highly overall correlated with ypos_la and 1 other fields | High correlation |
xpos_lo is highly overall correlated with city_do_cd | High correlation |
ypos_la is highly overall correlated with city_gn_gu_cd and 1 other fields | High correlation |
city_do_cd is highly overall correlated with city_gn_gu_cd and 2 other fields | High correlation |
city_do_cd is highly imbalanced (68.2%) | Imbalance |
tel_no has 33 (33.0%) missing values | Missing |
homepage_url has 52 (52.0%) missing values | Missing |
Reproduction
Analysis started | 2023-12-10 09:45:11.416409 |
---|---|
Analysis finished | 2023-12-10 09:45:15.840019 |
Duration | 4.42 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
subway_line_no
Categorical
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
5호선 | |
---|---|
4호선 | |
3호선 | |
1호선 | |
6호선 | |
Other values (4) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5호선 |
---|---|
2nd row | 3호선 |
3rd row | 9호선 |
4th row | 5호선 |
5th row | 9호선 |
Common Values
Value | Count | Frequency (%) |
5호선 | 23 | |
4호선 | 17 | |
3호선 | 14 | |
1호선 | 14 | |
6호선 | 11 | |
2호선 | 10 | |
9호선 | 5 | 5.0% |
8호선 | 3 | 3.0% |
7호선 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
5호선 | 23 | |
4호선 | 17 | |
3호선 | 14 | |
1호선 | 14 | |
6호선 | 11 | |
2호선 | 10 | |
9호선 | 5 | 5.0% |
8호선 | 3 | 3.0% |
7호선 | 3 | 3.0% |
Distinct | 64 |
---|---|
Distinct (%) | 64.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
종로3가 | 6 | 6.0% |
시청 | 6 | 6.0% |
충무로 | 6 | 6.0% |
동대문역사문화공원 | 3 | 3.0% |
광화문 | 3 | 3.0% |
경복궁 | 3 | 3.0% |
대공원 | 2 | 2.0% |
이촌 | 2 | 2.0% |
신금호 | 2 | 2.0% |
이태원 | 2 | 2.0% |
Other values (54) | 65 |
Most occurring characters
Value | Count | Frequency (%) |
로 | 17 | 5.3% |
대 | 12 | 3.7% |
가 | 11 | 3.4% |
역 | 11 | 3.4% |
문 | 11 | 3.4% |
원 | 10 | 3.1% |
화 | 9 | 2.8% |
사 | 9 | 2.8% |
동 | 9 | 2.8% |
종 | 8 | 2.5% |
Other values (92) | 215 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 313 | |
Decimal Number | 9 | 2.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 17 | 5.4% |
대 | 12 | 3.8% |
가 | 11 | 3.5% |
역 | 11 | 3.5% |
문 | 11 | 3.5% |
원 | 10 | 3.2% |
화 | 9 | 2.9% |
사 | 9 | 2.9% |
동 | 9 | 2.9% |
종 | 8 | 2.6% |
Other values (89) | 206 |
Decimal Number
Value | Count | Frequency (%) |
3 | 6 | |
4 | 2 | 22.2% |
5 | 1 | 11.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 313 | |
Common | 9 | 2.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 17 | 5.4% |
대 | 12 | 3.8% |
가 | 11 | 3.5% |
역 | 11 | 3.5% |
문 | 11 | 3.5% |
원 | 10 | 3.2% |
화 | 9 | 2.9% |
사 | 9 | 2.9% |
동 | 9 | 2.9% |
종 | 8 | 2.6% |
Other values (89) | 206 |
Common
Value | Count | Frequency (%) |
3 | 6 | |
4 | 2 | 22.2% |
5 | 1 | 11.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 313 | |
ASCII | 9 | 2.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
로 | 17 | 5.4% |
대 | 12 | 3.8% |
가 | 11 | 3.5% |
역 | 11 | 3.5% |
문 | 11 | 3.5% |
원 | 10 | 3.2% |
화 | 9 | 2.9% |
사 | 9 | 2.9% |
동 | 9 | 2.9% |
종 | 8 | 2.6% |
Other values (89) | 206 |
ASCII
Value | Count | Frequency (%) |
3 | 6 | |
4 | 2 | 22.2% |
5 | 1 | 11.1% |
tourist_nm
Text
Distinct | 82 |
---|---|
Distinct (%) | 82.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
광희문 | 4 | 3.3% |
러시아 | 3 | 2.5% |
공사관 | 3 | 2.5% |
국립중앙박물관 | 3 | 2.5% |
구 | 3 | 2.5% |
경찰박물관 | 2 | 1.6% |
홀 | 2 | 1.6% |
kbs | 2 | 1.6% |
교보문고 | 2 | 1.6% |
경성일보필동사옥터 | 2 | 1.6% |
Other values (86) | 96 |
Most occurring characters
Value | Count | Frequency (%) |
관 | 27 | 4.2% |
22 | 3.4% | |
국 | 19 | 3.0% |
원 | 17 | 2.7% |
립 | 14 | 2.2% |
산 | 14 | 2.2% |
동 | 13 | 2.0% |
물 | 13 | 2.0% |
시 | 13 | 2.0% |
가 | 12 | 1.9% |
Other values (173) | 477 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 576 | |
Space Separator | 22 | 3.4% |
Uppercase Letter | 15 | 2.3% |
Decimal Number | 11 | 1.7% |
Close Punctuation | 6 | 0.9% |
Open Punctuation | 6 | 0.9% |
Other Punctuation | 5 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
관 | 27 | 4.7% |
국 | 19 | 3.3% |
원 | 17 | 3.0% |
립 | 14 | 2.4% |
산 | 14 | 2.4% |
동 | 13 | 2.3% |
물 | 13 | 2.3% |
시 | 13 | 2.3% |
가 | 12 | 2.1% |
장 | 12 | 2.1% |
Other values (156) | 422 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3 | |
B | 3 | |
K | 3 | |
N | 2 | |
G | 2 | |
L | 2 |
Decimal Number
Value | Count | Frequency (%) |
6 | 3 | |
1 | 3 | |
0 | 2 | |
3 | 1 | 9.1% |
9 | 1 | 9.1% |
4 | 1 | 9.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 | |
, | 2 |
Space Separator
Value | Count | Frequency (%) |
22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Open Punctuation
Value | Count | Frequency (%) |
( | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 576 | |
Common | 50 | 7.8% |
Latin | 15 | 2.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
관 | 27 | 4.7% |
국 | 19 | 3.3% |
원 | 17 | 3.0% |
립 | 14 | 2.4% |
산 | 14 | 2.4% |
동 | 13 | 2.3% |
물 | 13 | 2.3% |
시 | 13 | 2.3% |
가 | 12 | 2.1% |
장 | 12 | 2.1% |
Other values (156) | 422 |
Common
Value | Count | Frequency (%) |
22 | ||
) | 6 | 12.0% |
( | 6 | 12.0% |
6 | 3 | 6.0% |
. | 3 | 6.0% |
1 | 3 | 6.0% |
0 | 2 | 4.0% |
, | 2 | 4.0% |
3 | 1 | 2.0% |
9 | 1 | 2.0% |
Latin
Value | Count | Frequency (%) |
S | 3 | |
B | 3 | |
K | 3 | |
N | 2 | |
G | 2 | |
L | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 576 | |
ASCII | 65 | 10.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
관 | 27 | 4.7% |
국 | 19 | 3.3% |
원 | 17 | 3.0% |
립 | 14 | 2.4% |
산 | 14 | 2.4% |
동 | 13 | 2.3% |
물 | 13 | 2.3% |
시 | 13 | 2.3% |
가 | 12 | 2.1% |
장 | 12 | 2.1% |
Other values (156) | 422 |
ASCII
Value | Count | Frequency (%) |
22 | ||
) | 6 | 9.2% |
( | 6 | 9.2% |
S | 3 | 4.6% |
B | 3 | 4.6% |
K | 3 | 4.6% |
6 | 3 | 4.6% |
. | 3 | 4.6% |
1 | 3 | 4.6% |
N | 2 | 3.1% |
Other values (7) | 11 |
load_addr
Text
Distinct | 76 |
---|---|
Distinct (%) | 76.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 41 |
---|---|
Median length | 28 |
Mean length | 18.4 |
Min length | 11 |
Characters and Unicode
Total characters | 1840 |
---|---|
Distinct characters | 189 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 58 ? |
---|---|
Unique (%) | 58.0% |
Sample
1st row | 서울특별시 영등포구 63로 50 한화금융센터_63 |
---|---|
2nd row | 부산광역시 강서구 대저1동 2158 |
3rd row | 서울특별시 영등포구 여의공원로 13 |
4th row | 서울특별시 영등포구 여의공원로 13 한국방송공사 |
5th row | 서울특별시 강서구 공항대로 376 KBS88체육관 |
Value | Count | Frequency (%) |
서울특별시 | 90 | |
중구 | 25 | 6.3% |
종로구 | 18 | 4.5% |
용산구 | 9 | 2.3% |
강서구 | 7 | 1.8% |
경기도 | 6 | 1.5% |
정동 | 5 | 1.3% |
영등포구 | 5 | 1.3% |
강남구 | 4 | 1.0% |
과천시 | 4 | 1.0% |
Other values (169) | 224 |
Most occurring characters
Value | Count | Frequency (%) |
297 | 16.1% | |
서 | 106 | 5.8% |
시 | 103 | 5.6% |
구 | 97 | 5.3% |
별 | 91 | 4.9% |
특 | 90 | 4.9% |
울 | 90 | 4.9% |
로 | 77 | 4.2% |
동 | 57 | 3.1% |
1 | 46 | 2.5% |
Other values (179) | 786 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1290 | |
Space Separator | 297 | 16.1% |
Decimal Number | 226 | 12.3% |
Dash Punctuation | 9 | 0.5% |
Uppercase Letter | 6 | 0.3% |
Lowercase Letter | 6 | 0.3% |
Open Punctuation | 2 | 0.1% |
Close Punctuation | 2 | 0.1% |
Connector Punctuation | 1 | 0.1% |
Math Symbol | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 106 | 8.2% |
시 | 103 | 8.0% |
구 | 97 | 7.5% |
별 | 91 | 7.1% |
특 | 90 | 7.0% |
울 | 90 | 7.0% |
로 | 77 | 6.0% |
동 | 57 | 4.4% |
중 | 27 | 2.1% |
산 | 21 | 1.6% |
Other values (153) | 531 |
Decimal Number
Value | Count | Frequency (%) |
1 | 46 | |
2 | 35 | |
3 | 33 | |
6 | 23 | |
8 | 21 | |
7 | 19 | |
4 | 19 | |
5 | 12 | 5.3% |
0 | 10 | 4.4% |
9 | 8 | 3.5% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 1 | |
u | 1 | |
e | 1 | |
l | 1 | |
y | 1 | |
t | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 2 | |
S | 2 | |
B | 1 | |
H | 1 |
Space Separator
Value | Count | Frequency (%) |
297 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Math Symbol
Value | Count | Frequency (%) |
~ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1290 | |
Common | 538 | |
Latin | 12 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 106 | 8.2% |
시 | 103 | 8.0% |
구 | 97 | 7.5% |
별 | 91 | 7.1% |
특 | 90 | 7.0% |
울 | 90 | 7.0% |
로 | 77 | 6.0% |
동 | 57 | 4.4% |
중 | 27 | 2.1% |
산 | 21 | 1.6% |
Other values (153) | 531 |
Common
Value | Count | Frequency (%) |
297 | ||
1 | 46 | 8.6% |
2 | 35 | 6.5% |
3 | 33 | 6.1% |
6 | 23 | 4.3% |
8 | 21 | 3.9% |
7 | 19 | 3.5% |
4 | 19 | 3.5% |
5 | 12 | 2.2% |
0 | 10 | 1.9% |
Other values (6) | 23 | 4.3% |
Latin
Value | Count | Frequency (%) |
K | 2 | |
S | 2 | |
B | 1 | |
b | 1 | |
u | 1 | |
H | 1 | |
e | 1 | |
l | 1 | |
y | 1 | |
t | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1290 | |
ASCII | 550 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
297 | ||
1 | 46 | 8.4% |
2 | 35 | 6.4% |
3 | 33 | 6.0% |
6 | 23 | 4.2% |
8 | 21 | 3.8% |
7 | 19 | 3.5% |
4 | 19 | 3.5% |
5 | 12 | 2.2% |
0 | 10 | 1.8% |
Other values (16) | 35 | 6.4% |
Hangul
Value | Count | Frequency (%) |
서 | 106 | 8.2% |
시 | 103 | 8.0% |
구 | 97 | 7.5% |
별 | 91 | 7.1% |
특 | 90 | 7.0% |
울 | 90 | 7.0% |
로 | 77 | 6.0% |
동 | 57 | 4.4% |
중 | 27 | 2.1% |
산 | 21 | 1.6% |
Other values (153) | 531 |
city_do_cd
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
11 | |
---|---|
41 | 7 |
26 | 3 |
44 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 11 |
---|---|
2nd row | 26 |
3rd row | 11 |
4th row | 11 |
5th row | 11 |
Common Values
Value | Count | Frequency (%) |
11 | 89 | |
41 | 7 | 7.0% |
26 | 3 | 3.0% |
44 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11 | 89 | |
41 | 7 | 7.0% |
26 | 3 | 3.0% |
44 | 1 | 1.0% |
city_gn_gu_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 26.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13872.95 |
Minimum | 11110 |
---|---|
Maximum | 44200 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11110 |
Q1 | 11140 |
median | 11170 |
Q3 | 11560 |
95-th percentile | 41290 |
Maximum | 44200 |
Range | 33090 |
Interquartile range (IQR) | 420 |
Descriptive statistics
Standard deviation | 8104.6202 |
---|---|
Coefficient of variation (CV) | 0.58420308 |
Kurtosis | 7.6437533 |
Mean | 13872.95 |
Median Absolute Deviation (MAD) | 60 |
Skewness | 3.0182474 |
Sum | 1387295 |
Variance | 65684868 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11140 | 25 | |
11110 | 18 | |
11170 | 9 | 9.0% |
11500 | 6 | 6.0% |
11560 | 5 | 5.0% |
11320 | 5 | 5.0% |
11740 | 4 | 4.0% |
41290 | 4 | 4.0% |
11650 | 3 | 3.0% |
11710 | 3 | 3.0% |
Other values (16) | 18 |
Value | Count | Frequency (%) |
11110 | 18 | |
11140 | 25 | |
11170 | 9 | 9.0% |
11200 | 2 | 2.0% |
11230 | 1 | 1.0% |
11290 | 1 | 1.0% |
11305 | 1 | 1.0% |
11320 | 5 | 5.0% |
11350 | 1 | 1.0% |
11440 | 1 | 1.0% |
Value | Count | Frequency (%) |
44200 | 1 | 1.0% |
41370 | 1 | 1.0% |
41290 | 4 | |
41250 | 1 | 1.0% |
26440 | 1 | 1.0% |
26350 | 1 | 1.0% |
26260 | 1 | 1.0% |
11740 | 4 | |
11710 | 3 | |
11650 | 3 |
xpos_lo
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 74 |
---|---|
Distinct (%) | 74.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.05094 |
Minimum | 126.80359 |
---|---|
Maximum | 129.12247 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 126.80359 |
---|---|
5-th percentile | 126.85343 |
Q1 | 126.97367 |
median | 126.99074 |
Q3 | 127.00921 |
95-th percentile | 127.14606 |
Maximum | 129.12247 |
Range | 2.318887 |
Interquartile range (IQR) | 0.0355415 |
Descriptive statistics
Standard deviation | 0.36147086 |
---|---|
Coefficient of variation (CV) | 0.0028450861 |
Kurtosis | 28.043138 |
Mean | 127.05094 |
Median Absolute Deviation (MAD) | 0.017642 |
Skewness | 5.3304232 |
Sum | 12705.094 |
Variance | 0.13066118 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
126.97314 | 5 | 5.0% |
127.008424 | 4 | 4.0% |
126.994106 | 4 | 4.0% |
126.97995 | 3 | 3.0% |
127.153556 | 2 | 2.0% |
127.009214 | 2 | 2.0% |
126.97385 | 2 | 2.0% |
126.989789 | 2 | 2.0% |
126.969666 | 2 | 2.0% |
126.992458 | 2 | 2.0% |
Other values (64) | 72 |
Value | Count | Frequency (%) |
126.803586 | 1 | |
126.814124 | 1 | |
126.838397 | 1 | |
126.8476 | 1 | |
126.847875 | 1 | |
126.853727 | 1 | |
126.870111 | 1 | |
126.880801 | 1 | |
126.916423 | 2 | |
126.916558 | 1 |
Value | Count | Frequency (%) |
129.122473 | 1 | |
129.096162 | 1 | |
128.971281 | 1 | |
127.153556 | 2 | |
127.14567 | 1 | |
127.130005 | 2 | |
127.123713 | 1 | |
127.111698 | 1 | |
127.07033 | 1 | |
127.064438 | 1 |
ypos_la
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 74 |
---|---|
Distinct (%) | 74.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.468868 |
Minimum | 35.200909 |
---|---|
Maximum | 37.944658 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 35.200909 |
---|---|
5-th percentile | 37.417445 |
Q1 | 37.523724 |
median | 37.558878 |
Q3 | 37.56902 |
95-th percentile | 37.589838 |
Maximum | 37.944658 |
Range | 2.743749 |
Interquartile range (IQR) | 0.045296 |
Descriptive statistics
Standard deviation | 0.41288189 |
---|---|
Coefficient of variation (CV) | 0.011019332 |
Kurtosis | 25.944675 |
Mean | 37.468868 |
Median Absolute Deviation (MAD) | 0.0199885 |
Skewness | -5.1009741 |
Sum | 3746.8868 |
Variance | 0.17047145 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.566617 | 5 | 5.0% |
37.563945 | 4 | 4.0% |
37.558878 | 4 | 4.0% |
37.523724 | 3 | 3.0% |
37.53926 | 2 | 2.0% |
37.477749 | 2 | 2.0% |
37.499973 | 2 | 2.0% |
37.564247 | 2 | 2.0% |
37.56902 | 2 | 2.0% |
37.561089 | 2 | 2.0% |
Other values (64) | 72 |
Value | Count | Frequency (%) |
35.200909 | 1 | |
35.210222 | 1 | |
35.214377 | 1 | |
36.767899 | 1 | |
37.159031 | 1 | |
37.431046 | 1 | |
37.433683 | 1 | |
37.433971 | 1 | |
37.438934 | 1 | |
37.471431 | 1 |
Value | Count | Frequency (%) |
37.944658 | 1 | |
37.672997 | 1 | |
37.665292 | 1 | |
37.649042 | 1 | |
37.593195 | 1 | |
37.589661 | 1 | |
37.587123 | 1 | |
37.582365 | 1 | |
37.58236 | 1 | |
37.58094 | 1 |
tel_no
Text
MISSING
 
Distinct | 52 |
---|---|
Distinct (%) | 77.6% |
Missing | 33 |
Missing (%) | 33.0% |
Memory size | 932.0 B |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 11.373134 |
Min length | 6 |
Characters and Unicode
Total characters | 762 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 40 ? |
---|---|
Unique (%) | 59.7% |
Sample
1st row | 1833-7001 |
---|---|
2nd row | 02-781-1000 |
3rd row | 02-781-1000 |
4th row | 02-2600-8808 |
5th row | 02-3773-1053 |
Value | Count | Frequency (%) |
02-3369-5882 | 4 | 6.0% |
02-2077-9000 | 3 | 4.5% |
1544-1900 | 2 | 3.0% |
02-3150-3681 | 2 | 3.0% |
02-3425-5252 | 2 | 3.0% |
02-472-2770 | 2 | 3.0% |
02-813-9625 | 2 | 3.0% |
02-2261-0517 | 2 | 3.0% |
02-3455-9277 | 2 | 3.0% |
02-781-1000 | 2 | 3.0% |
Other values (42) | 44 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 153 | |
- | 130 | |
2 | 119 | |
3 | 55 | 7.2% |
7 | 55 | 7.2% |
1 | 54 | 7.1% |
5 | 50 | 6.6% |
8 | 45 | 5.9% |
4 | 38 | 5.0% |
9 | 33 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 632 | |
Dash Punctuation | 130 | 17.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 153 | |
2 | 119 | |
3 | 55 | 8.7% |
7 | 55 | 8.7% |
1 | 54 | 8.5% |
5 | 50 | 7.9% |
8 | 45 | 7.1% |
4 | 38 | 6.0% |
9 | 33 | 5.2% |
6 | 30 | 4.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 130 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 762 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 153 | |
- | 130 | |
2 | 119 | |
3 | 55 | 7.2% |
7 | 55 | 7.2% |
1 | 54 | 7.1% |
5 | 50 | 6.6% |
8 | 45 | 5.9% |
4 | 38 | 5.0% |
9 | 33 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 762 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 153 | |
- | 130 | |
2 | 119 | |
3 | 55 | 7.2% |
7 | 55 | 7.2% |
1 | 54 | 7.1% |
5 | 50 | 6.6% |
8 | 45 | 5.9% |
4 | 38 | 5.0% |
9 | 33 | 4.3% |
homepage_url
Text
MISSING
 
Distinct | 41 |
---|---|
Distinct (%) | 85.4% |
Missing | 52 |
Missing (%) | 52.0% |
Memory size | 932.0 B |
Length
Max length | 66 |
---|---|
Median length | 33 |
Mean length | 27.645833 |
Min length | 16 |
Characters and Unicode
Total characters | 1327 |
---|---|
Distinct characters | 45 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 35 ? |
---|---|
Unique (%) | 72.9% |
Sample
1st row | http://www.63city.co.kr/ |
---|---|
2nd row | http://www.kbs.co.kr/ |
3rd row | http://office.kbs.co.kr/kbshall/ |
4th row | http://www.kbssw.co.kr/ |
5th row | http://www.lgscience.co.kr/ |
Value | Count | Frequency (%) |
http://www.museum.go.kr | 3 | 6.2% |
http://hanokmaeul.or.kr | 2 | 4.2% |
http://www.nanta.co.kr | 2 | 4.2% |
http://parks.seoul.go.kr/gildong | 2 | 4.2% |
http://www.nseoultower.com | 2 | 4.2% |
http://www.policemuseum.go.kr | 2 | 4.2% |
http://www.ntok.go.kr | 1 | 2.1% |
http://www.63city.co.kr | 1 | 2.1% |
http://www.gugak.go.kr | 1 | 2.1% |
http://www.nfm.go.kr | 1 | 2.1% |
Other values (31) | 31 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 143 | 10.8% |
. | 134 | 10.1% |
t | 110 | 8.3% |
w | 110 | 8.3% |
o | 87 | 6.6% |
r | 66 | 5.0% |
k | 66 | 5.0% |
h | 53 | 4.0% |
p | 53 | 4.0% |
e | 51 | 3.8% |
Other values (35) | 454 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 977 | |
Other Punctuation | 324 | 24.4% |
Decimal Number | 13 | 1.0% |
Uppercase Letter | 9 | 0.7% |
Math Symbol | 2 | 0.2% |
Connector Punctuation | 1 | 0.1% |
Dash Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 110 | 11.3% |
w | 110 | 11.3% |
o | 87 | 8.9% |
r | 66 | 6.8% |
k | 66 | 6.8% |
h | 53 | 5.4% |
p | 53 | 5.4% |
e | 51 | 5.2% |
a | 50 | 5.1% |
g | 44 | 4.5% |
Other values (15) | 287 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4 | |
0 | 3 | |
5 | 2 | |
6 | 1 | 7.7% |
3 | 1 | 7.7% |
4 | 1 | 7.7% |
9 | 1 | 7.7% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 143 | |
. | 134 | |
: | 44 | 13.6% |
? | 2 | 0.6% |
& | 1 | 0.3% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3 | |
I | 3 | |
M | 1 | 11.1% |
T | 1 | 11.1% |
E | 1 | 11.1% |
Math Symbol
Value | Count | Frequency (%) |
= | 2 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 986 | |
Common | 341 | 25.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 110 | 11.2% |
w | 110 | 11.2% |
o | 87 | 8.8% |
r | 66 | 6.7% |
k | 66 | 6.7% |
h | 53 | 5.4% |
p | 53 | 5.4% |
e | 51 | 5.2% |
a | 50 | 5.1% |
g | 44 | 4.5% |
Other values (20) | 296 |
Common
Value | Count | Frequency (%) |
/ | 143 | |
. | 134 | |
: | 44 | 12.9% |
1 | 4 | 1.2% |
0 | 3 | 0.9% |
= | 2 | 0.6% |
? | 2 | 0.6% |
5 | 2 | 0.6% |
_ | 1 | 0.3% |
6 | 1 | 0.3% |
Other values (5) | 5 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1327 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 143 | 10.8% |
. | 134 | 10.1% |
t | 110 | 8.3% |
w | 110 | 8.3% |
o | 87 | 6.6% |
r | 66 | 5.0% |
k | 66 | 5.0% |
h | 53 | 4.0% |
p | 53 | 4.0% |
e | 51 | 3.8% |
Other values (35) | 454 |
sales_tm
Categorical
Distinct | 20 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
10:00~18:00 | |
00:00~24:00 | 5 |
09:30~17:30 | 4 |
10:00~23:00 | 3 |
Other values (15) |
Length
Max length | 13 |
---|---|
Median length | 4 |
Mean length | 6.89 |
Min length | 4 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 9.0% |
Sample
1st row | 10:00~22:00 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 59 | |
10:00~18:00 | 6 | 6.0% |
00:00~24:00 | 5 | 5.0% |
09:30~17:30 | 4 | 4.0% |
10:00~23:00 | 3 | 3.0% |
09:00~18:00 | 3 | 3.0% |
10:00~16:00 | 3 | 3.0% |
09:30~22:00 | 2 | 2.0% |
06:00~18:00 | 2 | 2.0% |
09:00~17:30 | 2 | 2.0% |
Other values (10) | 11 | 11.0% |
Length
Value | Count | Frequency (%) |
na | 59 | |
10:00~18:00 | 6 | 5.9% |
00:00~24:00 | 5 | 4.9% |
09:30~17:30 | 4 | 3.9% |
10:00~23:00 | 3 | 2.9% |
09:00~18:00 | 3 | 2.9% |
10:00~16:00 | 3 | 2.9% |
09:30~22:00 | 2 | 2.0% |
06:00~18:00 | 2 | 2.0% |
09:00~17:30 | 2 | 2.0% |
Other values (12) | 13 | 12.7% |
base_ymd
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2020-12-31 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-12-31 |
---|---|
2nd row | 2020-12-31 |
3rd row | 2020-12-31 |
4th row | 2020-12-31 |
5th row | 2020-12-31 |
Common Values
Value | Count | Frequency (%) |
2020-12-31 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-12-31 | 100 |
subway_line_no | subway_station_nm | tourist_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | tel_no | homepage_url | sales_tm | |
---|---|---|---|---|---|---|---|---|---|---|---|
subway_line_no | 1.000 | 0.976 | 0.373 | 0.711 | 0.000 | 0.316 | 0.595 | 0.082 | 0.000 | 0.747 | 0.534 |
subway_station_nm | 0.976 | 1.000 | 0.997 | 0.997 | 0.999 | 1.000 | 0.995 | 0.988 | 0.979 | 0.983 | 0.896 |
tourist_nm | 0.373 | 0.997 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.997 | 1.000 |
load_addr | 0.711 | 0.997 | 1.000 | 1.000 | 0.998 | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 0.998 |
city_do_cd | 0.000 | 0.999 | 1.000 | 0.998 | 1.000 | 0.892 | 0.673 | 0.864 | 1.000 | 1.000 | 0.000 |
city_gn_gu_cd | 0.316 | 1.000 | 1.000 | 1.000 | 0.892 | 1.000 | 0.941 | 0.788 | 1.000 | 1.000 | 0.000 |
xpos_lo | 0.595 | 0.995 | 1.000 | 1.000 | 0.673 | 0.941 | 1.000 | 0.766 | 1.000 | 1.000 | 0.692 |
ypos_la | 0.082 | 0.988 | 1.000 | 1.000 | 0.864 | 0.788 | 0.766 | 1.000 | 1.000 | 1.000 | 0.000 |
tel_no | 0.000 | 0.979 | 1.000 | 0.999 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
homepage_url | 0.747 | 0.983 | 0.997 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
sales_tm | 0.534 | 0.896 | 1.000 | 0.998 | 0.000 | 0.000 | 0.692 | 0.000 | 1.000 | 1.000 | 1.000 |
subway_line_no | city_do_cd | sales_tm | |
---|---|---|---|
subway_line_no | 1.000 | 0.000 | 0.164 |
city_do_cd | 0.000 | 1.000 | 0.000 |
sales_tm | 0.164 | 0.000 | 1.000 |
city_gn_gu_cd | xpos_lo | ypos_la | subway_line_no | city_do_cd | sales_tm | |
---|---|---|---|---|---|---|
city_gn_gu_cd | 1.000 | 0.212 | -0.701 | 0.137 | 0.960 | 0.000 |
xpos_lo | 0.212 | 1.000 | -0.154 | 0.310 | 0.700 | 0.462 |
ypos_la | -0.701 | -0.154 | 1.000 | 0.036 | 0.844 | 0.000 |
subway_line_no | 0.137 | 0.310 | 0.036 | 1.000 | 0.000 | 0.164 |
city_do_cd | 0.960 | 0.700 | 0.844 | 0.000 | 1.000 | 0.000 |
sales_tm | 0.000 | 0.462 | 0.000 | 0.164 | 0.000 | 1.000 |
subway_line_no | subway_station_nm | tourist_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | tel_no | homepage_url | sales_tm | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 5호선 | 여의나루 | 63 시티 | 서울특별시 영등포구 63로 50 한화금융센터_63 | 11 | 11560 | 126.939757 | 37.519576 | 1833-7001 | http://www.63city.co.kr/ | 10:00~22:00 | 2020-12-31 |
1 | 3호선 | 체육공원 | 강서체육공원 | 부산광역시 강서구 대저1동 2158 | 26 | 26440 | 128.971281 | 35.210222 | <NA> | <NA> | <NA> | 2020-12-31 |
2 | 9호선 | 국회의사당 | KBS | 서울특별시 영등포구 여의공원로 13 | 11 | 11560 | 126.916423 | 37.525591 | 02-781-1000 | http://www.kbs.co.kr/ | <NA> | 2020-12-31 |
3 | 5호선 | 여의도 | KBS 홀 | 서울특별시 영등포구 여의공원로 13 한국방송공사 | 11 | 11560 | 126.916423 | 37.525591 | 02-781-1000 | http://office.kbs.co.kr/kbshall/ | <NA> | 2020-12-31 |
4 | 9호선 | 가양 | KBS스포츠월드 | 서울특별시 강서구 공항대로 376 KBS88체육관 | 11 | 11500 | 126.8476 | 37.556809 | 02-2600-8808 | http://www.kbssw.co.kr/ | <NA> | 2020-12-31 |
5 | 5호선 | 여의나루 | LG 사이언스 홀 | 서울특별시 영등포구 여의대로 128 | 11 | 11560 | 126.928961 | 37.527804 | 02-3773-1053 | http://www.lgscience.co.kr/ | 09:00~17:30 | 2020-12-31 |
6 | 2호선 | 역삼 | LG아트센터 | 서울특별시 강남구 논현로 508 | 11 | 11320 | 127.037619 | 37.502141 | 02-2005-0114 | http://www.lgart.com/ | <NA> | 2020-12-31 |
7 | 4호선 | 충렬사 | 충렬사 | 부산광역시 동래구 충렬대로 347 | 26 | 26260 | 129.096162 | 35.200909 | <NA> | <NA> | <NA> | 2020-12-31 |
8 | 4호선 | 명동 | N서울타워 | 서울특별시 용산구 남산공원길 126 | 11 | 11170 | 126.990751 | 37.550857 | 02-3455-9277 | http://www.nseoultower.com/ | 10:00~23:00 | 2020-12-31 |
9 | 8호선 | 장지역 | 가든파이브 | 서울특별시 송파구 충민로 66 가든파이브라이프 | 11 | 11710 | 127.123713 | 37.477635 | 02-2157-0100 | http://www.garden5.com/ | 10:30~21:00 | 2020-12-31 |
subway_line_no | subway_station_nm | tourist_nm | load_addr | city_do_cd | city_gn_gu_cd | xpos_lo | ypos_la | tel_no | homepage_url | sales_tm | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 5호선 | 종로3가 | 낙원떡, 악기상가 | 서울특별시 종로구 낙원동 | 11 | 11110 | 126.988624 | 37.572581 | <NA> | <NA> | <NA> | 2020-12-31 |
91 | 1호선 | 시청 | 난타전용극장 | 서울특별시 중구 명동길 26 유네스코회관 | 11 | 11140 | 126.983767 | 37.563465 | 02-739-8288 | http://www.nanta.co.kr/ | <NA> | 2020-12-31 |
92 | 2호선 | 시청 | 난타전용극장 | 서울특별시 중구 명동길 26 유네스코회관 | 11 | 11140 | 126.983767 | 37.563465 | 02-739-8288 | http://www.nanta.co.kr/ | <NA> | 2020-12-31 |
93 | 4호선 | 회현 | 남대문 | 서울특별시 중구 세종대로 40 | 11 | 11140 | 126.975326 | 37.559923 | 042-481-4650 | <NA> | <NA> | 2020-12-31 |
94 | 4호선 | 회현 | 남대문시장 | 서울특별시 중구 남대문시장4길 21 | 11 | 11140 | 126.977724 | 37.559223 | 02-753-2805 | http://namdaemunmarket.co.kr/ | 00:00~23:00 | 2020-12-31 |
95 | 6호선 | 한강진역 | 남산(N서울타워) | 서울특별시 용산구 남산공원길 126 | 11 | 11170 | 126.990751 | 37.550857 | 02-3455-9277 | http://www.nseoultower.com/ | 10:00~23:00 | 2020-12-31 |
96 | 4호선 | 충무로 | 남산골 한옥마을 | 서울특별시 중구 퇴계로34길 28 남산골한옥마을 | 11 | 11140 | 126.994106 | 37.558878 | 02-2261-0517 | http://hanokmaeul.or.kr/ | 09:00~21:00 | 2020-12-31 |
97 | 3호선 | 충무로 | 남산골한옥마을 | 서울특별시 중구 퇴계로34길 28 남산골한옥마을 | 11 | 11140 | 126.994106 | 37.558878 | 02-2261-0517 | http://hanokmaeul.or.kr/ | 09:00~21:00 | 2020-12-31 |
98 | 6호선 | 한강진역 | 남산야외식물원 | 서울특별시 용산구 소월로 323 | 11 | 11170 | 126.997021 | 37.541539 | 02-798-3771 | <NA> | 00:00~24:00 | 2020-12-31 |
99 | 4호선 | 명동 | 남산케이블카 | 서울특별시 중구 소파로 83 | 11 | 11140 | 126.983995 | 37.55661 | 02-753-2403 | http://www.cablecar.co.kr/ | 10:00~23:00 | 2020-12-31 |