Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 5 |
Dataset
Description | Sample |
---|---|
Author | 국민체육진흥공단 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=e558a1fd-1fe9-4f40-8f10-4badca253e6d |
tms is highly overall correlated with race_day | High correlation |
race_day is highly overall correlated with tms and 1 other fields | High correlation |
tak is highly overall correlated with starting and 1 other fields | High correlation |
starting is highly overall correlated with tak | High correlation |
eclnt is highly overall correlated with tak | High correlation |
stnd_year is highly overall correlated with race_day | High correlation |
stnd_year is highly imbalanced (80.6%) | Imbalance |
repr is highly imbalanced (52.3%) | Imbalance |
race_day has unique values | Unique |
tak has 4 (4.0%) zeros | Zeros |
starting has 9 (9.0%) zeros | Zeros |
eclnt has 9 (9.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 10:00:15.419189 |
---|---|
Analysis finished | 2023-12-10 10:00:21.460938 |
Duration | 6.04 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
stnd_year
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019 | |
---|---|
2021 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2021 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 97 | |
2021 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 97 | |
2021 | 3 | 3.0% |
tms
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 34 |
---|---|
Distinct (%) | 34.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31.16 |
Minimum | 10 |
---|---|
Maximum | 51 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 11 |
Q1 | 17 |
median | 34.5 |
Q3 | 43 |
95-th percentile | 50 |
Maximum | 51 |
Range | 41 |
Interquartile range (IQR) | 26 |
Descriptive statistics
Standard deviation | 13.652039 |
---|---|
Coefficient of variation (CV) | 0.43812707 |
Kurtosis | -1.4564971 |
Mean | 31.16 |
Median Absolute Deviation (MAD) | 12.5 |
Skewness | -0.21943898 |
Sum | 3116 |
Variance | 186.37818 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 5 | 5.0% |
12 | 4 | 4.0% |
43 | 3 | 3.0% |
19 | 3 | 3.0% |
16 | 3 | 3.0% |
15 | 3 | 3.0% |
14 | 3 | 3.0% |
13 | 3 | 3.0% |
10 | 3 | 3.0% |
40 | 3 | 3.0% |
Other values (24) | 67 |
Value | Count | Frequency (%) |
10 | 3 | |
11 | 5 | |
12 | 4 | |
13 | 3 | |
14 | 3 | |
15 | 3 | |
16 | 3 | |
17 | 3 | |
18 | 3 | |
19 | 3 |
Value | Count | Frequency (%) |
51 | 3 | |
50 | 3 | |
49 | 3 | |
48 | 3 | |
47 | 3 | |
46 | 3 | |
45 | 2 | |
44 | 3 | |
43 | 3 | |
42 | 3 |
day_ord
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 | |
---|---|
3 | |
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 3 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 35 | |
3 | 34 | |
2 | 31 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 35 | |
3 | 34 | |
2 | 31 |
race_day
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20191386 |
Minimum | 20190308 |
---|---|
Maximum | 20210319 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190308 |
---|---|
5-th percentile | 20190317 |
Q1 | 20190504 |
median | 20190904 |
Q3 | 20191108 |
95-th percentile | 20191227 |
Maximum | 20210319 |
Range | 20011 |
Interquartile range (IQR) | 604.5 |
Descriptive statistics
Standard deviation | 3360.1838 |
---|---|
Coefficient of variation (CV) | 0.0001664167 |
Kurtosis | 29.334365 |
Mean | 20191386 |
Median Absolute Deviation (MAD) | 262 |
Skewness | 5.5177837 |
Sum | 2.0191386 × 109 |
Variance | 11290835 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190726 | 1 | 1.0% |
20191019 | 1 | 1.0% |
20190324 | 1 | 1.0% |
20190323 | 1 | 1.0% |
20190322 | 1 | 1.0% |
20190317 | 1 | 1.0% |
20190316 | 1 | 1.0% |
20190315 | 1 | 1.0% |
20190310 | 1 | 1.0% |
20190309 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
20190308 | 1 | |
20190309 | 1 | |
20190310 | 1 | |
20190315 | 1 | |
20190316 | 1 | |
20190317 | 1 | |
20190322 | 1 | |
20190323 | 1 | |
20190324 | 1 | |
20190329 | 1 |
Value | Count | Frequency (%) |
20210319 | 1 | |
20210314 | 1 | |
20210313 | 1 | |
20191229 | 1 | |
20191228 | 1 | |
20191227 | 1 | |
20191222 | 1 | |
20191221 | 1 | |
20191220 | 1 | |
20191215 | 1 |
tak
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 18 |
---|---|
Distinct (%) | 18.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.22 |
Minimum | 0 |
---|---|
Maximum | 17 |
Zeros | 4 |
Zeros (%) | 4.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 4.75 |
median | 7 |
Q3 | 10 |
95-th percentile | 13.05 |
Maximum | 17 |
Range | 17 |
Interquartile range (IQR) | 5.25 |
Descriptive statistics
Standard deviation | 3.9043617 |
---|---|
Coefficient of variation (CV) | 0.54077032 |
Kurtosis | -0.52451253 |
Mean | 7.22 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.10128905 |
Sum | 722 |
Variance | 15.24404 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 14 | |
9 | 13 | |
10 | 12 | |
2 | 10 | |
8 | 7 | 7.0% |
5 | 7 | 7.0% |
3 | 5 | 5.0% |
13 | 5 | 5.0% |
7 | 5 | 5.0% |
12 | 4 | 4.0% |
Other values (8) | 18 |
Value | Count | Frequency (%) |
0 | 4 | 4.0% |
1 | 2 | 2.0% |
2 | 10 | |
3 | 5 | 5.0% |
4 | 4 | 4.0% |
5 | 7 | |
6 | 14 | |
7 | 5 | 5.0% |
8 | 7 | |
9 | 13 |
Value | Count | Frequency (%) |
17 | 1 | 1.0% |
16 | 1 | 1.0% |
15 | 1 | 1.0% |
14 | 2 | 2.0% |
13 | 5 | 5.0% |
12 | 4 | 4.0% |
11 | 3 | 3.0% |
10 | 12 | |
9 | 13 | |
8 | 7 |
rora
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
1 | |
2 | |
3 | 3 |
4 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 1 |
3rd row | 2 |
4th row | 0 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
0 | 60 | |
1 | 25 | |
2 | 10 | 10.0% |
3 | 3 | 3.0% |
4 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 60 | |
1 | 25 | |
2 | 10 | 10.0% |
3 | 3 | 3.0% |
4 | 2 | 2.0% |
repr
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
1 | |
2 | 6 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 85 | |
1 | 9 | 9.0% |
2 | 6 | 6.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 85 | |
1 | 9 | 9.0% |
2 | 6 | 6.0% |
starting
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.83 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9 |
Zeros (%) | 9.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 4 |
Q3 | 6 |
95-th percentile | 7 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.2920812 |
---|---|
Coefficient of variation (CV) | 0.59845463 |
Kurtosis | -0.96278276 |
Mean | 3.83 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.0074873425 |
Sum | 383 |
Variance | 5.2536364 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 18 | |
4 | 16 | |
5 | 14 | |
6 | 12 | |
7 | 12 | |
0 | 9 | |
3 | 8 | |
1 | 8 | |
8 | 2 | 2.0% |
9 | 1 | 1.0% |
Value | Count | Frequency (%) |
0 | 9 | |
1 | 8 | |
2 | 18 | |
3 | 8 | |
4 | 16 | |
5 | 14 | |
6 | 12 | |
7 | 12 | |
8 | 2 | 2.0% |
9 | 1 | 1.0% |
Value | Count | Frequency (%) |
9 | 1 | 1.0% |
8 | 2 | 2.0% |
7 | 12 | |
6 | 12 | |
5 | 14 | |
4 | 16 | |
3 | 8 | |
2 | 18 | |
1 | 8 | |
0 | 9 |
eclnt
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.4 |
Minimum | 0 |
---|---|
Maximum | 8 |
Zeros | 9 |
Zeros (%) | 9.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 3 |
Q3 | 5 |
95-th percentile | 8 |
Maximum | 8 |
Range | 8 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.1177461 |
---|---|
Coefficient of variation (CV) | 0.6228665 |
Kurtosis | -0.34396676 |
Mean | 3.4 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.37635367 |
Sum | 340 |
Variance | 4.4848485 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 18 | |
4 | 18 | |
2 | 17 | |
5 | 13 | |
1 | 10 | |
0 | 9 | |
8 | 6 | 6.0% |
6 | 6 | 6.0% |
7 | 3 | 3.0% |
Value | Count | Frequency (%) |
0 | 9 | |
1 | 10 | |
2 | 17 | |
3 | 18 | |
4 | 18 | |
5 | 13 | |
6 | 6 | 6.0% |
7 | 3 | 3.0% |
8 | 6 | 6.0% |
Value | Count | Frequency (%) |
8 | 6 | 6.0% |
7 | 3 | 3.0% |
6 | 6 | 6.0% |
5 | 13 | |
4 | 18 | |
3 | 18 | |
2 | 17 | |
1 | 10 | |
0 | 9 |
get_eclet
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
1 | |
2 | |
5 | 1 |
4 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 2 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
0 | 48 | |
1 | 28 | |
2 | 22 | |
5 | 1 | 1.0% |
4 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 48 | |
1 | 28 | |
2 | 22 | |
5 | 1 | 1.0% |
4 | 1 | 1.0% |
stnd_year | tms | day_ord | race_day | tak | rora | repr | starting | eclnt | get_eclet | |
---|---|---|---|---|---|---|---|---|---|---|
stnd_year | 1.000 | 0.258 | 0.000 | 0.963 | 0.000 | 0.000 | 0.091 | 0.000 | 0.260 | 0.000 |
tms | 0.258 | 1.000 | 0.000 | 0.276 | 0.209 | 0.000 | 0.203 | 0.238 | 0.081 | 0.283 |
day_ord | 0.000 | 0.000 | 1.000 | 0.000 | 0.543 | 0.000 | 0.000 | 0.426 | 0.483 | 0.407 |
race_day | 0.963 | 0.276 | 0.000 | 1.000 | 0.000 | 0.000 | 0.091 | 0.000 | 0.258 | 0.000 |
tak | 0.000 | 0.209 | 0.543 | 0.000 | 1.000 | 0.000 | 0.000 | 0.833 | 0.604 | 0.432 |
rora | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.324 | 0.000 | 0.149 | 0.000 |
repr | 0.091 | 0.203 | 0.000 | 0.091 | 0.000 | 0.324 | 1.000 | 0.218 | 0.000 | 0.260 |
starting | 0.000 | 0.238 | 0.426 | 0.000 | 0.833 | 0.000 | 0.218 | 1.000 | 0.262 | 0.000 |
eclnt | 0.260 | 0.081 | 0.483 | 0.258 | 0.604 | 0.149 | 0.000 | 0.262 | 1.000 | 0.076 |
get_eclet | 0.000 | 0.283 | 0.407 | 0.000 | 0.432 | 0.000 | 0.260 | 0.000 | 0.076 | 1.000 |
stnd_year | rora | day_ord | repr | get_eclet | |
---|---|---|---|---|---|
stnd_year | 1.000 | 0.000 | 0.000 | 0.150 | 0.000 |
rora | 0.000 | 1.000 | 0.000 | 0.256 | 0.000 |
day_ord | 0.000 | 0.000 | 1.000 | 0.000 | 0.333 |
repr | 0.150 | 0.256 | 0.000 | 1.000 | 0.200 |
get_eclet | 0.000 | 0.000 | 0.333 | 0.200 | 1.000 |
tms | race_day | tak | starting | eclnt | stnd_year | day_ord | rora | repr | get_eclet | |
---|---|---|---|---|---|---|---|---|---|---|
tms | 1.000 | 0.844 | -0.245 | -0.371 | 0.082 | 0.246 | 0.000 | 0.000 | 0.083 | 0.161 |
race_day | 0.844 | 1.000 | -0.383 | -0.457 | -0.062 | 0.826 | 0.000 | 0.000 | 0.150 | 0.000 |
tak | -0.245 | -0.383 | 1.000 | 0.790 | 0.685 | 0.000 | 0.371 | 0.000 | 0.000 | 0.186 |
starting | -0.371 | -0.457 | 0.790 | 1.000 | 0.348 | 0.000 | 0.271 | 0.000 | 0.124 | 0.000 |
eclnt | 0.082 | -0.062 | 0.685 | 0.348 | 1.000 | 0.249 | 0.234 | 0.079 | 0.000 | 0.031 |
stnd_year | 0.246 | 0.826 | 0.000 | 0.000 | 0.249 | 1.000 | 0.000 | 0.000 | 0.150 | 0.000 |
day_ord | 0.000 | 0.000 | 0.371 | 0.271 | 0.234 | 0.000 | 1.000 | 0.000 | 0.000 | 0.333 |
rora | 0.000 | 0.000 | 0.000 | 0.000 | 0.079 | 0.000 | 0.000 | 1.000 | 0.256 | 0.000 |
repr | 0.083 | 0.150 | 0.000 | 0.124 | 0.000 | 0.150 | 0.000 | 0.256 | 1.000 | 0.200 |
get_eclet | 0.161 | 0.000 | 0.186 | 0.000 | 0.031 | 0.000 | 0.333 | 0.000 | 0.200 | 1.000 |
stnd_year | tms | day_ord | race_day | tak | rora | repr | starting | eclnt | get_eclet | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 2019 | 30 | 1 | 20190726 | 9 | 0 | 0 | 4 | 3 | 2 |
1 | 2021 | 12 | 1 | 20210319 | 5 | 1 | 0 | 4 | 2 | 0 |
2 | 2019 | 30 | 3 | 20190728 | 3 | 2 | 0 | 2 | 3 | 0 |
3 | 2019 | 31 | 1 | 20190802 | 11 | 0 | 0 | 3 | 8 | 0 |
4 | 2019 | 31 | 2 | 20190803 | 9 | 1 | 0 | 6 | 3 | 1 |
5 | 2019 | 31 | 3 | 20190804 | 6 | 0 | 0 | 4 | 2 | 0 |
6 | 2019 | 32 | 1 | 20190809 | 13 | 0 | 0 | 4 | 8 | 1 |
7 | 2021 | 11 | 3 | 20210314 | 2 | 0 | 2 | 2 | 0 | 0 |
8 | 2019 | 32 | 3 | 20190811 | 6 | 1 | 0 | 3 | 4 | 0 |
9 | 2019 | 33 | 1 | 20190816 | 10 | 1 | 0 | 5 | 6 | 1 |
stnd_year | tms | day_ord | race_day | tak | rora | repr | starting | eclnt | get_eclet | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 2019 | 18 | 1 | 20190503 | 9 | 0 | 0 | 7 | 2 | 0 |
91 | 2019 | 18 | 2 | 20190504 | 10 | 0 | 0 | 6 | 3 | 1 |
92 | 2019 | 18 | 3 | 20190505 | 6 | 0 | 0 | 5 | 1 | 0 |
93 | 2019 | 19 | 1 | 20190510 | 10 | 0 | 1 | 2 | 4 | 5 |
94 | 2019 | 19 | 2 | 20190511 | 9 | 0 | 0 | 2 | 3 | 4 |
95 | 2019 | 19 | 3 | 20190512 | 0 | 0 | 0 | 0 | 0 | 0 |
96 | 2019 | 20 | 1 | 20190517 | 12 | 2 | 1 | 8 | 6 | 1 |
97 | 2019 | 20 | 2 | 20190518 | 10 | 0 | 2 | 7 | 4 | 1 |
98 | 2019 | 20 | 3 | 20190519 | 8 | 0 | 0 | 6 | 2 | 0 |
99 | 2019 | 21 | 1 | 20190524 | 8 | 1 | 0 | 5 | 4 | 0 |