Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 76 |
Missing cells (%) | 15.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.2 KiB |
Average record size in memory | 43.3 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 1 |
Text | 2 |
Dataset
Description | Sample |
---|---|
Author | 국민체육진흥공단 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=c17379be-4738-432a-afbe-c0e3aadd2bc8 |
stnd_year is highly overall correlated with heal_stat | High correlation |
heal_stat is highly overall correlated with tms and 1 other fields | High correlation |
tms is highly overall correlated with heal_stat | High correlation |
stnd_year is highly imbalanced (80.6%) | Imbalance |
trng_stat has 76 (76.0%) missing values | Missing |
Reproduction
Analysis started | 2023-12-10 10:07:13.119412 |
---|---|
Analysis finished | 2023-12-10 10:07:14.174697 |
Duration | 1.06 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
stnd_year
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019 | |
---|---|
2021 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2021 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 97 | |
2021 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 97 | |
2021 | 3 | 3.0% |
tms
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.21 |
Minimum | 10 |
---|---|
Maximum | 32 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 18 |
Q1 | 28 |
median | 30 |
Q3 | 30 |
95-th percentile | 30 |
Maximum | 32 |
Range | 22 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 3.8357568 |
---|---|
Coefficient of variation (CV) | 0.13597153 |
Kurtosis | 7.6981488 |
Mean | 28.21 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -2.7971704 |
Sum | 2821 |
Variance | 14.71303 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30 | 55 | |
28 | 33 | |
18 | 7 | 7.0% |
32 | 3 | 3.0% |
15 | 1 | 1.0% |
10 | 1 | 1.0% |
Value | Count | Frequency (%) |
10 | 1 | 1.0% |
15 | 1 | 1.0% |
18 | 7 | 7.0% |
28 | 33 | |
30 | 55 | |
32 | 3 | 3.0% |
Value | Count | Frequency (%) |
32 | 3 | 3.0% |
30 | 55 | |
28 | 33 | |
18 | 7 | 7.0% |
15 | 1 | 1.0% |
10 | 1 | 1.0% |
racer_no
Text
Distinct | 79 |
---|---|
Distinct (%) | 79.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
04-007 | 2 | 2.0% |
03-004 | 2 | 2.0% |
02-009 | 2 | 2.0% |
02-016 | 2 | 2.0% |
14-003 | 2 | 2.0% |
01-035 | 2 | 2.0% |
02-031 | 2 | 2.0% |
02-015 | 2 | 2.0% |
15-008 | 2 | 2.0% |
01-030 | 2 | 2.0% |
Other values (69) | 80 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 233 | |
- | 100 | |
1 | 93 | 15.5% |
2 | 40 | 6.7% |
3 | 29 | 4.8% |
4 | 24 | 4.0% |
5 | 24 | 4.0% |
7 | 18 | 3.0% |
9 | 16 | 2.7% |
6 | 12 | 2.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 500 | |
Dash Punctuation | 100 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 233 | |
1 | 93 | 18.6% |
2 | 40 | 8.0% |
3 | 29 | 5.8% |
4 | 24 | 4.8% |
5 | 24 | 4.8% |
7 | 18 | 3.6% |
9 | 16 | 3.2% |
6 | 12 | 2.4% |
8 | 11 | 2.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 600 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 233 | |
- | 100 | |
1 | 93 | 15.5% |
2 | 40 | 6.7% |
3 | 29 | 4.8% |
4 | 24 | 4.0% |
5 | 24 | 4.0% |
7 | 18 | 3.0% |
9 | 16 | 2.7% |
6 | 12 | 2.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 600 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 233 | |
- | 100 | |
1 | 93 | 15.5% |
2 | 40 | 6.7% |
3 | 29 | 4.8% |
4 | 24 | 4.0% |
5 | 24 | 4.0% |
7 | 18 | 3.0% |
9 | 16 | 2.7% |
6 | 12 | 2.0% |
heal_stat
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
양호 | |
---|---|
<NA> |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.66 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 양호 |
---|---|
2nd row | 양호 |
3rd row | 양호 |
4th row | 양호 |
5th row | 양호 |
Common Values
Value | Count | Frequency (%) |
양호 | 67 | |
<NA> | 33 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
양호 | 67 | |
na | 33 |
trng_stat
Text
MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 76 |
Missing (%) | 76.0% |
Memory size | 932.0 B |
Length
Max length | 26 |
---|---|
Median length | 23 |
Mean length | 18.75 |
Min length | 7 |
Characters and Unicode
Total characters | 450 |
---|---|
Distinct characters | 22 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 24 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 스타트 16회 |
---|---|
2nd row | 개인선회 10회, 편대선회 5회, 스타트 10회 |
3rd row | 개인선회 5회, 스타트 24회, 편대 5회 |
4th row | 개인선회 10회, 스타트 10회, 편대 5회 |
5th row | 개인선회 7회. 스타트 7회. |
Value | Count | Frequency (%) |
개인선회 | 19 | |
스타트 | 18 | |
5회 | 9 | 8.5% |
편대 | 8 | 7.5% |
10회 | 8 | 7.5% |
8회 | 5 | 4.7% |
11회 | 3 | 2.8% |
2회 | 3 | 2.8% |
1회 | 2 | 1.9% |
20회 | 2 | 1.9% |
Other values (23) | 29 |
Most occurring characters
Value | Count | Frequency (%) |
84 | ||
회 | 81 | |
, | 28 | 6.2% |
선 | 25 | 5.6% |
스 | 23 | 5.1% |
타 | 23 | 5.1% |
트 | 23 | 5.1% |
개 | 22 | 4.9% |
인 | 22 | 4.9% |
1 | 21 | 4.7% |
Other values (12) | 98 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 247 | |
Decimal Number | 87 | 19.3% |
Space Separator | 84 | 18.7% |
Other Punctuation | 32 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 21 | |
0 | 14 | |
5 | 13 | |
2 | 13 | |
4 | 7 | 8.0% |
3 | 7 | 8.0% |
8 | 6 | 6.9% |
6 | 3 | 3.4% |
7 | 2 | 2.3% |
9 | 1 | 1.1% |
Other Letter
Value | Count | Frequency (%) |
회 | 81 | |
선 | 25 | 10.1% |
스 | 23 | 9.3% |
타 | 23 | 9.3% |
트 | 23 | 9.3% |
개 | 22 | 8.9% |
인 | 22 | 8.9% |
편 | 14 | 5.7% |
대 | 14 | 5.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 28 | |
. | 4 | 12.5% |
Space Separator
Value | Count | Frequency (%) |
84 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 247 | |
Common | 203 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
84 | ||
, | 28 | 13.8% |
1 | 21 | 10.3% |
0 | 14 | 6.9% |
5 | 13 | 6.4% |
2 | 13 | 6.4% |
4 | 7 | 3.4% |
3 | 7 | 3.4% |
8 | 6 | 3.0% |
. | 4 | 2.0% |
Other values (3) | 6 | 3.0% |
Hangul
Value | Count | Frequency (%) |
회 | 81 | |
선 | 25 | 10.1% |
스 | 23 | 9.3% |
타 | 23 | 9.3% |
트 | 23 | 9.3% |
개 | 22 | 8.9% |
인 | 22 | 8.9% |
편 | 14 | 5.7% |
대 | 14 | 5.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 247 | |
ASCII | 203 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
84 | ||
, | 28 | 13.8% |
1 | 21 | 10.3% |
0 | 14 | 6.9% |
5 | 13 | 6.4% |
2 | 13 | 6.4% |
4 | 7 | 3.4% |
3 | 7 | 3.4% |
8 | 6 | 3.0% |
. | 4 | 2.0% |
Other values (3) | 6 | 3.0% |
Hangul
Value | Count | Frequency (%) |
회 | 81 | |
선 | 25 | 10.1% |
스 | 23 | 9.3% |
타 | 23 | 9.3% |
트 | 23 | 9.3% |
개 | 22 | 8.9% |
인 | 22 | 8.9% |
편 | 14 | 5.7% |
대 | 14 | 5.7% |
stnd_year | tms | racer_no | trng_stat | |
---|---|---|---|---|
stnd_year | 1.000 | 0.000 | 0.000 | NaN |
tms | 0.000 | 1.000 | 0.000 | 1.000 |
racer_no | 0.000 | 0.000 | 1.000 | 1.000 |
trng_stat | NaN | 1.000 | 1.000 | 1.000 |
stnd_year | heal_stat | |
---|---|---|
stnd_year | 1.000 | 1.000 |
heal_stat | 1.000 | 1.000 |
tms | stnd_year | heal_stat | |
---|---|---|---|
tms | 1.000 | 0.000 | 1.000 |
stnd_year | 0.000 | 1.000 | 1.000 |
heal_stat | 1.000 | 1.000 | 1.000 |
stnd_year | tms | racer_no | heal_stat | trng_stat | |
---|---|---|---|---|---|
0 | 2019 | 15 | 07-011 | 양호 | 스타트 16회 |
1 | 2021 | 32 | 02-015 | 양호 | <NA> |
2 | 2019 | 30 | 14-011 | 양호 | <NA> |
3 | 2019 | 30 | 14-008 | 양호 | <NA> |
4 | 2019 | 30 | 11-007 | 양호 | <NA> |
5 | 2019 | 30 | 11-006 | 양호 | <NA> |
6 | 2019 | 10 | 03-004 | 양호 | <NA> |
7 | 2021 | 32 | 02-008 | 양호 | <NA> |
8 | 2019 | 30 | 09-002 | 양호 | <NA> |
9 | 2019 | 30 | 13-001 | 양호 | 개인선회 10회, 편대선회 5회, 스타트 10회 |
stnd_year | tms | racer_no | heal_stat | trng_stat | |
---|---|---|---|---|---|
90 | 2019 | 28 | 01-030 | <NA> | <NA> |
91 | 2019 | 28 | 01-019 | <NA> | <NA> |
92 | 2019 | 28 | 01-002 | <NA> | <NA> |
93 | 2019 | 18 | 12-009 | 양호 | 개인선회5회, 스타트 30회 , 편대 1회 |
94 | 2019 | 18 | 04-006 | 양호 | 개인선회 3회,스타트 20회 |
95 | 2019 | 18 | 01-047 | 양호 | 개인선회 2회, 스타트 5회 |
96 | 2019 | 18 | 02-010 | 양호 | 개인선회 4회,스타트32회. |
97 | 2019 | 18 | 09-002 | 양호 | 개인선회 4회. 스타트 16회 |
98 | 2019 | 18 | 15-013 | 양호 | 스타트 45회 |
99 | 2019 | 18 | 14-003 | 양호 | <NA> |