Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.4 KiB |
Average record size in memory | 55.3 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 5 |
Dataset
Description | Sample |
---|---|
Author | 부산정보산업진흥원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=d7325e20-c23e-4f75-9101-5868ee8e361b |
base_month is highly overall correlated with ticket_dt and 2 other fields | High correlation |
ticket_dt is highly overall correlated with base_month and 2 other fields | High correlation |
show_dt is highly overall correlated with base_month and 2 other fields | High correlation |
base_year is highly overall correlated with base_month and 2 other fields | High correlation |
base_year is highly imbalanced (80.6%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 09:46:30.177750 |
---|---|
Analysis finished | 2023-12-10 09:46:35.640674 |
Duration | 5.46 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
base_year
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2017 | |
---|---|
2020 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2017 |
---|---|
2nd row | 2020 |
3rd row | 2017 |
4th row | 2017 |
5th row | 2017 |
Common Values
Value | Count | Frequency (%) |
2017 | 97 | |
2020 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2017 | 97 | |
2020 | 3 | 3.0% |
base_month
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.95 |
Q1 | 4 |
median | 5 |
Q3 | 6 |
95-th percentile | 7 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.5109031 |
---|---|
Coefficient of variation (CV) | 0.3284572 |
Kurtosis | -0.11388064 |
Mean | 4.6 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.45542182 |
Sum | 460 |
Variance | 2.2828283 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 30 | |
4 | 18 | |
6 | 18 | |
3 | 17 | |
7 | 10 | 10.0% |
1 | 5 | 5.0% |
2 | 2 | 2.0% |
Value | Count | Frequency (%) |
1 | 5 | 5.0% |
2 | 2 | 2.0% |
3 | 17 | |
4 | 18 | |
5 | 30 | |
6 | 18 | |
7 | 10 | 10.0% |
Value | Count | Frequency (%) |
7 | 10 | 10.0% |
6 | 18 | |
5 | 30 | |
4 | 18 | |
3 | 17 | |
2 | 2 | 2.0% |
1 | 5 | 5.0% |
base_day
Real number (ℝ)
Distinct | 26 |
---|---|
Distinct (%) | 26.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.93 |
Minimum | 1 |
---|---|
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 8 |
median | 20 |
Q3 | 27 |
95-th percentile | 30.05 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 19 |
Descriptive statistics
Standard deviation | 10.011362 |
---|---|
Coefficient of variation (CV) | 0.55835818 |
Kurtosis | -1.3666987 |
Mean | 17.93 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.28559762 |
Sum | 1793 |
Variance | 100.22737 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30 | 13 | 13.0% |
8 | 7 | 7.0% |
2 | 7 | 7.0% |
26 | 6 | 6.0% |
10 | 6 | 6.0% |
19 | 6 | 6.0% |
6 | 5 | 5.0% |
31 | 5 | 5.0% |
20 | 5 | 5.0% |
27 | 5 | 5.0% |
Other values (16) | 35 |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 7 | |
3 | 2 | 2.0% |
5 | 2 | 2.0% |
6 | 5 | |
7 | 2 | 2.0% |
8 | 7 | |
10 | 6 | |
12 | 1 | 1.0% |
13 | 1 | 1.0% |
Value | Count | Frequency (%) |
31 | 5 | 5.0% |
30 | 13 | |
29 | 1 | 1.0% |
28 | 3 | 3.0% |
27 | 5 | 5.0% |
26 | 6 | |
25 | 2 | 2.0% |
24 | 3 | 3.0% |
23 | 3 | 3.0% |
22 | 4 | 4.0% |
ticket_dt
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 77 |
---|---|
Distinct (%) | 77.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20171357 |
Minimum | 20170119 |
---|---|
Maximum | 20200113 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20170119 |
---|---|
5-th percentile | 20170224 |
Q1 | 20170386 |
median | 20170504 |
Q3 | 20170610 |
95-th percentile | 20170719 |
Maximum | 20200113 |
Range | 29994 |
Interquartile range (IQR) | 224.75 |
Descriptive statistics
Standard deviation | 5084.341 |
---|---|
Coefficient of variation (CV) | 0.00025205746 |
Kurtosis | 29.846614 |
Mean | 20171357 |
Median Absolute Deviation (MAD) | 107.5 |
Skewness | 5.5877016 |
Sum | 2.0171357 × 109 |
Variance | 25850523 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20170629 | 4 | 4.0% |
20170428 | 4 | 4.0% |
20170525 | 3 | 3.0% |
20170329 | 3 | 3.0% |
20170510 | 2 | 2.0% |
20170421 | 2 | 2.0% |
20170531 | 2 | 2.0% |
20170405 | 2 | 2.0% |
20170324 | 2 | 2.0% |
20170522 | 2 | 2.0% |
Other values (67) | 74 |
Value | Count | Frequency (%) |
20170119 | 1 | |
20170120 | 1 | |
20170214 | 1 | |
20170220 | 1 | |
20170221 | 1 | |
20170224 | 1 | |
20170227 | 1 | |
20170228 | 1 | |
20170302 | 1 | |
20170310 | 1 |
Value | Count | Frequency (%) |
20200113 | 1 | |
20200112 | 1 | |
20200110 | 1 | |
20170727 | 1 | |
20170725 | 1 | |
20170719 | 2 | |
20170718 | 1 | |
20170714 | 1 | |
20170713 | 1 | |
20170707 | 1 |
show_dt
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 41 |
---|---|
Distinct (%) | 41.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20171378 |
Minimum | 20170119 |
---|---|
Maximum | 20200112 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20170119 |
---|---|
5-th percentile | 20170302 |
Q1 | 20170408 |
median | 20170506 |
Q3 | 20170618 |
95-th percentile | 20170727 |
Maximum | 20200112 |
Range | 29993 |
Interquartile range (IQR) | 210 |
Descriptive statistics
Standard deviation | 5080.3881 |
---|---|
Coefficient of variation (CV) | 0.00025186123 |
Kurtosis | 29.848516 |
Mean | 20171378 |
Median Absolute Deviation (MAD) | 104 |
Skewness | 5.5879574 |
Sum | 2.0171378 × 109 |
Variance | 25810343 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20170630 | 9 | 9.0% |
20170408 | 7 | 7.0% |
20170526 | 6 | 6.0% |
20170302 | 5 | 5.0% |
20170506 | 5 | 5.0% |
20170322 | 4 | 4.0% |
20170727 | 4 | 4.0% |
20170530 | 4 | 4.0% |
20170719 | 4 | 4.0% |
20170610 | 3 | 3.0% |
Other values (31) | 49 |
Value | Count | Frequency (%) |
20170119 | 1 | 1.0% |
20170120 | 1 | 1.0% |
20170214 | 1 | 1.0% |
20170224 | 1 | 1.0% |
20170302 | 5 | |
20170310 | 1 | 1.0% |
20170321 | 1 | 1.0% |
20170322 | 4 | |
20170323 | 3 | |
20170324 | 1 | 1.0% |
Value | Count | Frequency (%) |
20200112 | 1 | 1.0% |
20200110 | 2 | 2.0% |
20170727 | 4 | |
20170719 | 4 | |
20170707 | 2 | 2.0% |
20170630 | 9 | |
20170629 | 1 | 1.0% |
20170627 | 1 | 1.0% |
20170624 | 1 | 1.0% |
20170616 | 3 | 3.0% |
ticket_cnt
Real number (ℝ)
Distinct | 53 |
---|---|
Distinct (%) | 53.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.25 |
Minimum | 1 |
---|---|
Maximum | 320 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 10 |
median | 22.5 |
Q3 | 50 |
95-th percentile | 121.95 |
Maximum | 320 |
Range | 319 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 49.814885 |
---|---|
Coefficient of variation (CV) | 1.269169 |
Kurtosis | 11.797986 |
Mean | 39.25 |
Median Absolute Deviation (MAD) | 16.5 |
Skewness | 3.0086036 |
Sum | 3925 |
Variance | 2481.5227 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 9 | 9.0% |
30 | 6 | 6.0% |
50 | 5 | 5.0% |
3 | 4 | 4.0% |
20 | 4 | 4.0% |
2 | 4 | 4.0% |
14 | 3 | 3.0% |
7 | 3 | 3.0% |
40 | 3 | 3.0% |
1 | 3 | 3.0% |
Other values (43) | 56 |
Value | Count | Frequency (%) |
1 | 3 | 3.0% |
2 | 4 | |
3 | 4 | |
4 | 2 | 2.0% |
5 | 2 | 2.0% |
6 | 2 | 2.0% |
7 | 3 | 3.0% |
8 | 1 | 1.0% |
10 | 9 | |
12 | 2 | 2.0% |
Value | Count | Frequency (%) |
320 | 1 | |
221 | 1 | |
202 | 1 | |
167 | 1 | |
140 | 1 | |
121 | 1 | |
108 | 1 | |
100 | 1 | |
99 | 1 | |
97 | 1 |
base_year | base_month | base_day | ticket_dt | show_dt | ticket_cnt | |
---|---|---|---|---|---|---|
base_year | 1.000 | 0.693 | 0.755 | 0.963 | 0.963 | 0.000 |
base_month | 0.693 | 1.000 | 0.741 | 0.785 | 0.785 | 0.217 |
base_day | 0.755 | 0.741 | 1.000 | 0.754 | 0.754 | 0.251 |
ticket_dt | 0.963 | 0.785 | 0.754 | 1.000 | 0.963 | 0.000 |
show_dt | 0.963 | 0.785 | 0.754 | 0.963 | 1.000 | 0.000 |
ticket_cnt | 0.000 | 0.217 | 0.251 | 0.000 | 0.000 | 1.000 |
base_month | base_day | ticket_dt | show_dt | ticket_cnt | base_year | |
---|---|---|---|---|---|---|
base_month | 1.000 | 0.227 | 0.789 | 0.803 | -0.129 | 0.730 |
base_day | 0.227 | 1.000 | 0.324 | 0.344 | -0.059 | 0.397 |
ticket_dt | 0.789 | 0.324 | 1.000 | 0.979 | -0.145 | 0.826 |
show_dt | 0.803 | 0.344 | 0.979 | 1.000 | -0.172 | 0.826 |
ticket_cnt | -0.129 | -0.059 | -0.145 | -0.172 | 1.000 | 0.000 |
base_year | 0.730 | 0.397 | 0.826 | 0.826 | 0.000 | 1.000 |
base_year | base_month | base_day | ticket_dt | show_dt | ticket_cnt | |
---|---|---|---|---|---|---|
0 | 2017 | 1 | 19 | 20170119 | 20170119 | 83 |
1 | 2020 | 1 | 10 | 20200110 | 20200110 | 40 |
2 | 2017 | 1 | 20 | 20170120 | 20170120 | 108 |
3 | 2017 | 2 | 14 | 20170214 | 20170214 | 7 |
4 | 2017 | 2 | 24 | 20170224 | 20170224 | 202 |
5 | 2017 | 3 | 2 | 20170220 | 20170302 | 7 |
6 | 2017 | 3 | 2 | 20170221 | 20170302 | 20 |
7 | 2020 | 1 | 10 | 20200113 | 20200110 | 24 |
8 | 2017 | 3 | 2 | 20170227 | 20170302 | 71 |
9 | 2017 | 3 | 2 | 20170228 | 20170302 | 5 |
base_year | base_month | base_day | ticket_dt | show_dt | ticket_cnt | |
---|---|---|---|---|---|---|
90 | 2017 | 7 | 7 | 20170629 | 20170707 | 8 |
91 | 2017 | 7 | 7 | 20170707 | 20170707 | 14 |
92 | 2017 | 7 | 19 | 20170713 | 20170719 | 50 |
93 | 2017 | 7 | 19 | 20170714 | 20170719 | 50 |
94 | 2017 | 7 | 19 | 20170718 | 20170719 | 43 |
95 | 2017 | 7 | 19 | 20170719 | 20170719 | 16 |
96 | 2017 | 7 | 27 | 20170629 | 20170727 | 10 |
97 | 2017 | 7 | 27 | 20170719 | 20170727 | 14 |
98 | 2017 | 7 | 27 | 20170725 | 20170727 | 10 |
99 | 2017 | 7 | 27 | 20170727 | 20170727 | 72 |