Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 3 |
Duplicate rows (%) | 3.0% |
Total size in memory | 4.2 KiB |
Average record size in memory | 43.3 B |
Variable types
DateTime | 2 |
---|---|
Numeric | 1 |
Categorical | 2 |
Dataset
Description | Sample |
---|---|
Author | 국민체육진흥공단 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=5aabaf89-534f-4c6c-b7bf-1849969ffc89 |
Dataset has 3 (3.0%) duplicate rows | Duplicates |
state_nm is highly overall correlated with state_cd | High correlation |
state_cd is highly overall correlated with state_nm | High correlation |
state_cd is highly imbalanced (63.4%) | Imbalance |
state_nm is highly imbalanced (63.4%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 10:12:31.267844 |
---|---|
Analysis finished | 2023-12-10 10:12:32.071487 |
Duration | 0.8 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
watch_date
Date
Distinct | 55 |
---|---|
Distinct (%) | 55.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2015-01-12 00:00:00 |
---|---|
Maximum | 2019-11-21 00:00:00 |
tour_time_nm
Date
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2023-12-10 09:00:00 |
---|---|
Maximum | 2023-12-10 11:00:00 |
watch_member_no
Real number (ℝ)
Distinct | 38 |
---|---|
Distinct (%) | 38.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 49.87 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 17.9 |
Q1 | 24 |
median | 40 |
Q3 | 72.25 |
95-th percentile | 100 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 48.25 |
Descriptive statistics
Standard deviation | 31.123298 |
---|---|
Coefficient of variation (CV) | 0.6240886 |
Kurtosis | -1.1256649 |
Mean | 49.87 |
Median Absolute Deviation (MAD) | 19.5 |
Skewness | 0.57054187 |
Sum | 4987 |
Variance | 968.6597 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 20 | |
20 | 10 | 10.0% |
24 | 7 | 7.0% |
30 | 5 | 5.0% |
50 | 5 | 5.0% |
70 | 4 | 4.0% |
25 | 4 | 4.0% |
27 | 3 | 3.0% |
31 | 3 | 3.0% |
23 | 3 | 3.0% |
Other values (28) | 36 |
Value | Count | Frequency (%) |
1 | 2 | 2.0% |
11 | 1 | 1.0% |
14 | 1 | 1.0% |
16 | 1 | 1.0% |
18 | 1 | 1.0% |
20 | 10 | |
21 | 1 | 1.0% |
22 | 1 | 1.0% |
23 | 3 | 3.0% |
24 | 7 |
Value | Count | Frequency (%) |
100 | 20 | |
95 | 1 | 1.0% |
90 | 1 | 1.0% |
80 | 1 | 1.0% |
75 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
70 | 4 | 4.0% |
65 | 2 | 2.0% |
60 | 2 | 2.0% |
state_cd
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2 | |
---|---|
0 | 7 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 93 | |
0 | 7 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 93 | |
0 | 7 | 7.0% |
state_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
확정 | |
---|---|
예약신청 | 7 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.14 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 예약신청 |
---|---|
2nd row | 예약신청 |
3rd row | 확정 |
4th row | 확정 |
5th row | 확정 |
Common Values
Value | Count | Frequency (%) |
확정 | 93 | |
예약신청 | 7 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
확정 | 93 | |
예약신청 | 7 | 7.0% |
watch_date | tour_time_nm | watch_member_no | state_cd | state_nm | |
---|---|---|---|---|---|
watch_date | 1.000 | 0.827 | 0.792 | 0.777 | 0.777 |
tour_time_nm | 0.827 | 1.000 | 0.651 | 0.000 | 0.000 |
watch_member_no | 0.792 | 0.651 | 1.000 | 0.307 | 0.307 |
state_cd | 0.777 | 0.000 | 0.307 | 1.000 | 0.993 |
state_nm | 0.777 | 0.000 | 0.307 | 0.993 | 1.000 |
state_nm | state_cd | |
---|---|---|
state_nm | 1.000 | 0.922 |
state_cd | 0.922 | 1.000 |
watch_member_no | state_cd | state_nm | |
---|---|---|---|
watch_member_no | 1.000 | 0.224 | 0.224 |
state_cd | 0.224 | 1.000 | 0.922 |
state_nm | 0.224 | 0.922 | 1.000 |
watch_date | tour_time_nm | watch_member_no | state_cd | state_nm | |
---|---|---|---|---|---|
0 | 2015-01-12 | 10:00 | 20 | 0 | 예약신청 |
1 | 2019-11-14 | 10:00 | 31 | 0 | 예약신청 |
2 | 2015-11-12 | 10:00 | 26 | 2 | 확정 |
3 | 2015-11-26 | 11:00 | 14 | 2 | 확정 |
4 | 2016-04-06 | 11:00 | 24 | 2 | 확정 |
5 | 2016-04-06 | 10:00 | 42 | 2 | 확정 |
6 | 2016-04-06 | 10:00 | 58 | 2 | 확정 |
7 | 2019-11-14 | 10:00 | 31 | 0 | 예약신청 |
8 | 2016-04-07 | 10:00 | 51 | 2 | 확정 |
9 | 2016-04-07 | 11:00 | 60 | 2 | 확정 |
watch_date | tour_time_nm | watch_member_no | state_cd | state_nm | |
---|---|---|---|---|---|
90 | 2017-06-21 | 11:00 | 20 | 2 | 확정 |
91 | 2017-06-21 | 11:00 | 18 | 0 | 예약신청 |
92 | 2017-06-22 | 10:00 | 20 | 2 | 확정 |
93 | 2017-06-29 | 11:00 | 43 | 2 | 확정 |
94 | 2017-09-06 | 11:00 | 31 | 2 | 확정 |
95 | 2017-09-13 | 11:00 | 100 | 2 | 확정 |
96 | 2017-09-14 | 11:00 | 49 | 2 | 확정 |
97 | 2017-09-14 | 10:00 | 40 | 2 | 확정 |
98 | 2017-09-20 | 10:00 | 100 | 2 | 확정 |
99 | 2017-09-21 | 10:00 | 23 | 2 | 확정 |
Most frequently occurring
watch_date | tour_time_nm | watch_member_no | state_cd | state_nm | # duplicates | |
---|---|---|---|---|---|---|
0 | 2016-05-04 | 10:00 | 24 | 2 | 확정 | 2 |
1 | 2016-09-28 | 11:00 | 30 | 2 | 확정 | 2 |
2 | 2019-11-14 | 10:00 | 31 | 0 | 예약신청 | 2 |