Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.8 KiB |
Average record size in memory | 69.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 5 |
Dataset
Description | Sample |
---|---|
Author | 원투씨엠 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=e5714b70-3600-11ec-bbc0-d7035fffebeb |
visit_area_gugun_klang_nm is highly overall correlated with seq_no and 5 other fields | High correlation |
crtfc_str_nm is highly overall correlated with seq_no and 5 other fields | High correlation |
visit_area_addr is highly overall correlated with seq_no and 5 other fields | High correlation |
seq_no is highly overall correlated with cstmr_id and 4 other fields | High correlation |
goods_online_sle_dt is highly overall correlated with str_visit_dt and 5 other fields | High correlation |
str_visit_dt is highly overall correlated with goods_online_sle_dt and 5 other fields | High correlation |
cstmr_id is highly overall correlated with seq_no and 3 other fields | High correlation |
cstmr_visit_co is highly overall correlated with seq_no and 6 other fields | High correlation |
cstmr_visit_co is highly imbalanced (64.8%) | Imbalance |
seq_no has unique values | Unique |
str_visit_dt has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:51:40.344582 |
---|---|
Analysis finished | 2023-12-10 09:51:43.528337 |
Duration | 3.18 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
seq_no
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 441.82 |
Minimum | 1 |
---|---|
Maximum | 13059 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6.95 |
Q1 | 27.75 |
median | 53.5 |
Q3 | 78.25 |
95-th percentile | 98.05 |
Maximum | 13059 |
Range | 13058 |
Interquartile range (IQR) | 50.5 |
Descriptive statistics
Standard deviation | 2230.0765 |
---|---|
Coefficient of variation (CV) | 5.0474774 |
Kurtosis | 29.887261 |
Mean | 441.82 |
Median Absolute Deviation (MAD) | 25.5 |
Skewness | 5.5932224 |
Sum | 44182 |
Variance | 4973241 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 | |
12 | 1 |
Value | Count | Frequency (%) |
13059 | 1 | |
13058 | 1 | |
13057 | 1 | |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 |
cstmr_id
Categorical
HIGH CORRELATION
 
Distinct | 36 |
---|---|
Distinct (%) | 36.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
601363384001 | 3 |
---|---|
601370148001 | 3 |
601370236001 | 3 |
601353398001 | 3 |
601372470001 | 3 |
Other values (31) |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 11.82 |
Min length | 6 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 601353091001 |
---|---|
2nd row | nankai |
3rd row | 601353398001 |
4th row | 601353398001 |
5th row | 601353398001 |
Common Values
Value | Count | Frequency (%) |
601363384001 | 3 | 3.0% |
601370148001 | 3 | 3.0% |
601370236001 | 3 | 3.0% |
601353398001 | 3 | 3.0% |
601372470001 | 3 | 3.0% |
601356277001 | 3 | 3.0% |
601356543001 | 3 | 3.0% |
601356831001 | 3 | 3.0% |
601356864001 | 3 | 3.0% |
601357175001 | 3 | 3.0% |
Other values (26) | 70 |
Length
Value | Count | Frequency (%) |
601363384001 | 3 | 3.0% |
601372041001 | 3 | 3.0% |
601362459001 | 3 | 3.0% |
nankai | 3 | 3.0% |
601370938001 | 3 | 3.0% |
601366768001 | 3 | 3.0% |
601366963001 | 3 | 3.0% |
601369074001 | 3 | 3.0% |
601368097001 | 3 | 3.0% |
601368128001 | 3 | 3.0% |
Other values (26) | 70 |
cstmr_visit_co
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
3 | |
---|---|
2 | 7 |
44 | 3 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.03 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 44 |
3rd row | 3 |
4th row | 3 |
5th row | 3 |
Common Values
Value | Count | Frequency (%) |
3 | 90 | |
2 | 7 | 7.0% |
44 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3 | 90 | |
2 | 7 | 7.0% |
44 | 3 | 3.0% |
visit_area_addr
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
京都市左京区田中上柳町25-3 | |
---|---|
京都市中京区壬生賀陽御所町3番地20 | |
京都市左京区上高野東山55 | |
大阪府泉南市泉南郡田尻町空港中1 | 3 |
Length
Max length | 18 |
---|---|
Median length | 16 |
Mean length | 15.37 |
Min length | 13 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 京都市左京区田中上柳町25-3 |
---|---|
2nd row | 大阪府泉南市泉南郡田尻町空港中1 |
3rd row | 京都市左京区田中上柳町25-3 |
4th row | 京都市中京区壬生賀陽御所町3番地20 |
5th row | 京都市左京区上高野東山55 |
Common Values
Value | Count | Frequency (%) |
京都市左京区田中上柳町25-3 | 34 | |
京都市中京区壬生賀陽御所町3番地20 | 32 | |
京都市左京区上高野東山55 | 31 | |
大阪府泉南市泉南郡田尻町空港中1 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
京都市左京区田中上柳町25-3 | 34 | |
京都市中京区壬生賀陽御所町3番地20 | 32 | |
京都市左京区上高野東山55 | 31 | |
大阪府泉南市泉南郡田尻町空港中1 | 3 | 3.0% |
visit_area_gugun_klang_nm
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
교토부 교토시 사쿄구 | |
---|---|
교토부 교토시 나카교구 | |
오사카부 센난시 센난군 | 3 |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 11.35 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 교토부 교토시 사쿄구 |
---|---|
2nd row | 오사카부 센난시 센난군 |
3rd row | 교토부 교토시 사쿄구 |
4th row | 교토부 교토시 나카교구 |
5th row | 교토부 교토시 사쿄구 |
Common Values
Value | Count | Frequency (%) |
교토부 교토시 사쿄구 | 65 | |
교토부 교토시 나카교구 | 32 | |
오사카부 센난시 센난군 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
교토부 | 97 | |
교토시 | 97 | |
사쿄구 | 65 | |
나카교구 | 32 | 10.7% |
오사카부 | 3 | 1.0% |
센난시 | 3 | 1.0% |
센난군 | 3 | 1.0% |
crtfc_str_nm
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
叡山電鉄 | |
---|---|
京福電気鉄道(鋼索係事務所) | |
京都八瀬 瑠璃光院 | |
n・e・s・t関西空港店 | 3 |
Length
Max length | 14 |
---|---|
Median length | 12 |
Mean length | 8.99 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 叡山電鉄 |
---|---|
2nd row | n・e・s・t関西空港店 |
3rd row | 叡山電鉄 |
4th row | 京福電気鉄道(鋼索係事務所) |
5th row | 京都八瀬 瑠璃光院 |
Common Values
Value | Count | Frequency (%) |
叡山電鉄 | 34 | |
京福電気鉄道(鋼索係事務所) | 32 | |
京都八瀬 瑠璃光院 | 31 | |
n・e・s・t関西空港店 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
叡山電鉄 | 34 | |
京福電気鉄道(鋼索係事務所) | 32 | |
京都八瀬 | 31 | |
瑠璃光院 | 31 | |
n・e・s・t関西空港店 | 3 | 2.3% |
goods_online_sle_dt
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 38 |
---|---|
Distinct (%) | 38.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0201098 × 1013 |
Minimum | 2.0200323 × 1013 |
---|---|
Maximum | 2.0201129 × 1013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2.0200323 × 1013 |
---|---|
5-th percentile | 2.0201107 × 1013 |
Q1 | 2.0201119 × 1013 |
median | 2.0201122 × 1013 |
Q3 | 2.0201126 × 1013 |
95-th percentile | 2.0201129 × 1013 |
Maximum | 2.0201129 × 1013 |
Range | 8.0599147 × 108 |
Interquartile range (IQR) | 7106105 |
Descriptive statistics
Standard deviation | 1.3681937 × 108 |
---|---|
Coefficient of variation (CV) | 6.7728682 × 10-6 |
Kurtosis | 29.763069 |
Mean | 2.0201098 × 1013 |
Median Absolute Deviation (MAD) | 4015088.5 |
Skewness | -5.5764023 |
Sum | 2.0201098 × 1015 |
Variance | 1.8719541 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20201126142137 | 3 | 3.0% |
20201125130833 | 3 | 3.0% |
20201107162024 | 3 | 3.0% |
20201115093330 | 3 | 3.0% |
20201129154728 | 3 | 3.0% |
20201122083130 | 3 | 3.0% |
20201124164212 | 3 | 3.0% |
20201129051422 | 3 | 3.0% |
20201107165743 | 3 | 3.0% |
20201129084316 | 3 | 3.0% |
Other values (28) | 70 |
Value | Count | Frequency (%) |
20200323163255 | 1 | 1.0% |
20200323163347 | 1 | 1.0% |
20200327085236 | 1 | 1.0% |
20201107000014 | 1 | 1.0% |
20201107000936 | 2 | |
20201107162024 | 3 | |
20201107165743 | 3 | |
20201115084721 | 3 | |
20201115093330 | 3 | |
20201115102122 | 3 |
Value | Count | Frequency (%) |
20201129154728 | 3 | |
20201129113023 | 3 | |
20201129105711 | 3 | |
20201129084316 | 3 | |
20201129051422 | 3 | |
20201128174335 | 3 | |
20201128153016 | 3 | |
20201128134322 | 2 | |
20201126170421 | 3 | |
20201126142137 | 3 |
str_visit_dt
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0201098 × 1013 |
Minimum | 2.0200323 × 1013 |
---|---|
Maximum | 2.0201129 × 1013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2.0200323 × 1013 |
---|---|
5-th percentile | 2.0201107 × 1013 |
Q1 | 2.0201119 × 1013 |
median | 2.0201122 × 1013 |
Q3 | 2.0201126 × 1013 |
95-th percentile | 2.0201129 × 1013 |
Maximum | 2.0201129 × 1013 |
Range | 8.0603949 × 108 |
Interquartile range (IQR) | 7003040.2 |
Descriptive statistics
Standard deviation | 1.3682629 × 108 |
---|---|
Coefficient of variation (CV) | 6.7732104 × 10-6 |
Kurtosis | 29.763359 |
Mean | 2.0201098 × 1013 |
Median Absolute Deviation (MAD) | 4007443 |
Skewness | -5.5764406 |
Sum | 2.0201098 × 1015 |
Variance | 1.8721433 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20201107160042 | 1 | 1.0% |
20201107182138 | 1 | 1.0% |
20201122185818 | 1 | 1.0% |
20201122172931 | 1 | 1.0% |
20201122165700 | 1 | 1.0% |
20201124180939 | 1 | 1.0% |
20201124170647 | 1 | 1.0% |
20201124164250 | 1 | 1.0% |
20201129203114 | 1 | 1.0% |
20201129192358 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
20200323163628 | 1 | |
20200323163643 | 1 | |
20200327100113 | 1 | |
20201107160042 | 1 | |
20201107162039 | 1 | |
20201107165929 | 1 | |
20201107175053 | 1 | |
20201107175349 | 1 | |
20201107182106 | 1 | |
20201107182138 | 1 |
Value | Count | Frequency (%) |
20201129203114 | 1 | |
20201129192358 | 1 | |
20201129190148 | 1 | |
20201129184748 | 1 | |
20201129184430 | 1 | |
20201129183159 | 1 | |
20201129180222 | 1 | |
20201129180137 | 1 | |
20201129172741 | 1 | |
20201129172458 | 1 |
seq_no | cstmr_id | cstmr_visit_co | visit_area_addr | visit_area_gugun_klang_nm | crtfc_str_nm | goods_online_sle_dt | str_visit_dt | |
---|---|---|---|---|---|---|---|---|
seq_no | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.919 | 0.919 |
cstmr_id | 1.000 | 1.000 | 1.000 | 0.260 | 0.768 | 0.260 | 1.000 | 1.000 |
cstmr_visit_co | 1.000 | 1.000 | 1.000 | 0.671 | 0.942 | 0.671 | 1.000 | 1.000 |
visit_area_addr | 1.000 | 0.260 | 0.671 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
visit_area_gugun_klang_nm | 1.000 | 0.768 | 0.942 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
crtfc_str_nm | 1.000 | 0.260 | 0.671 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
goods_online_sle_dt | 0.919 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.692 |
str_visit_dt | 0.919 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.692 | 1.000 |
visit_area_gugun_klang_nm | cstmr_visit_co | crtfc_str_nm | visit_area_addr | cstmr_id | |
---|---|---|---|---|---|
visit_area_gugun_klang_nm | 1.000 | 0.704 | 0.995 | 0.995 | 0.410 |
cstmr_visit_co | 0.704 | 1.000 | 0.697 | 0.697 | 0.812 |
crtfc_str_nm | 0.995 | 0.697 | 1.000 | 1.000 | 0.084 |
visit_area_addr | 0.995 | 0.697 | 1.000 | 1.000 | 0.084 |
cstmr_id | 0.410 | 0.812 | 0.084 | 0.084 | 1.000 |
seq_no | goods_online_sle_dt | str_visit_dt | cstmr_id | cstmr_visit_co | visit_area_addr | visit_area_gugun_klang_nm | crtfc_str_nm | |
---|---|---|---|---|---|---|---|---|
seq_no | 1.000 | -0.085 | -0.092 | 0.808 | 0.995 | 0.990 | 0.995 | 0.990 |
goods_online_sle_dt | -0.085 | 1.000 | 0.979 | 0.808 | 0.995 | 0.990 | 0.995 | 0.990 |
str_visit_dt | -0.092 | 0.979 | 1.000 | 0.808 | 0.995 | 0.990 | 0.995 | 0.990 |
cstmr_id | 0.808 | 0.808 | 0.808 | 1.000 | 0.812 | 0.084 | 0.410 | 0.084 |
cstmr_visit_co | 0.995 | 0.995 | 0.995 | 0.812 | 1.000 | 0.697 | 0.704 | 0.697 |
visit_area_addr | 0.990 | 0.990 | 0.990 | 0.084 | 0.697 | 1.000 | 0.995 | 1.000 |
visit_area_gugun_klang_nm | 0.995 | 0.995 | 0.995 | 0.410 | 0.704 | 0.995 | 1.000 | 0.995 |
crtfc_str_nm | 0.990 | 0.990 | 0.990 | 0.084 | 0.697 | 1.000 | 0.995 | 1.000 |
seq_no | cstmr_id | cstmr_visit_co | visit_area_addr | visit_area_gugun_klang_nm | crtfc_str_nm | goods_online_sle_dt | str_visit_dt | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 601353091001 | 2 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201107000014 | 20201107160042 |
1 | 13057 | nankai | 44 | 大阪府泉南市泉南郡田尻町空港中1 | 오사카부 센난시 센난군 | n・e・s・t関西空港店 | 20200323163347 | 20200323163628 |
2 | 3 | 601353398001 | 3 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201122140131 | 20201122164123 |
3 | 4 | 601353398001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201122140131 | 20201122171416 |
4 | 5 | 601353398001 | 3 | 京都市左京区上高野東山55 | 교토부 교토시 사쿄구 | 京都八瀬 瑠璃光院 | 20201122140131 | 20201122183215 |
5 | 6 | 601355962001 | 3 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201107000936 | 20201107175349 |
6 | 7 | 601355962001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201107000936 | 20201107182106 |
7 | 13058 | nankai | 44 | 大阪府泉南市泉南郡田尻町空港中1 | 오사카부 센난시 센난군 | n・e・s・t関西空港店 | 20200323163255 | 20200323163643 |
8 | 9 | 601356277001 | 3 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201126170421 | 20201126171610 |
9 | 10 | 601356277001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201126170421 | 20201126180033 |
seq_no | cstmr_id | cstmr_visit_co | visit_area_addr | visit_area_gugun_klang_nm | crtfc_str_nm | goods_online_sle_dt | str_visit_dt | |
---|---|---|---|---|---|---|---|---|
90 | 91 | 601371180001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201128174335 | 20201128182007 |
91 | 92 | 601371180001 | 3 | 京都市左京区上高野東山55 | 교토부 교토시 사쿄구 | 京都八瀬 瑠璃光院 | 20201128174335 | 20201128192305 |
92 | 93 | 601372041001 | 3 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201129113023 | 20201129172458 |
93 | 94 | 601372041001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201129113023 | 20201129180222 |
94 | 95 | 601372041001 | 3 | 京都市左京区上高野東山55 | 교토부 교토시 사쿄구 | 京都八瀬 瑠璃光院 | 20201129113023 | 20201129184430 |
95 | 96 | 601372470001 | 3 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201115102122 | 20201115160100 |
96 | 97 | 601372470001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201115102122 | 20201115162705 |
97 | 98 | 601372470001 | 3 | 京都市左京区上高野東山55 | 교토부 교토시 사쿄구 | 京都八瀬 瑠璃光院 | 20201115102122 | 20201115182237 |
98 | 99 | 601372976001 | 3 | 京都市左京区田中上柳町25-3 | 교토부 교토시 사쿄구 | 叡山電鉄 | 20201122142250 | 20201122154934 |
99 | 100 | 601372976001 | 3 | 京都市中京区壬生賀陽御所町3番地20 | 교토부 교토시 나카교구 | 京福電気鉄道(鋼索係事務所) | 20201122142250 | 20201122161319 |