Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 101 |
Missing cells (%) | 10.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.3 KiB |
Average record size in memory | 85.3 B |
Variable types
Categorical | 5 |
---|---|
Text | 1 |
Numeric | 2 |
Boolean | 1 |
Unsupported | 1 |
Dataset
Description | Sample |
---|---|
Author | 국민체육진흥공단 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=b05c6e3f-a67e-4aaa-b057-68781475be2d |
prtc_tot_grde is highly overall correlated with prtc_pas_div_nm and 1 other fields | High correlation |
orst_tot_grde is highly overall correlated with prtc_pas_div_nm and 1 other fields | High correlation |
efc_yy is highly overall correlated with qf_grade_nm and 2 other fields | High correlation |
qf_grade_nm is highly overall correlated with efc_yy and 2 other fields | High correlation |
cour_nm is highly overall correlated with efc_yy and 2 other fields | High correlation |
prtc_pas_div_nm is highly overall correlated with prtc_tot_grde and 3 other fields | High correlation |
fnl_pas_yn is highly overall correlated with prtc_tot_grde and 3 other fields | High correlation |
qf_itm_nm is highly overall correlated with efc_yy and 4 other fields | High correlation |
efc_yy is highly imbalanced (80.6%) | Imbalance |
qf_grade_nm is highly imbalanced (80.6%) | Imbalance |
cour_nm is highly imbalanced (80.6%) | Imbalance |
zon_nm has 100 (100.0%) missing values | Missing |
usr_no has unique values | Unique |
zon_nm is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
prtc_tot_grde has 7 (7.0%) zeros | Zeros |
orst_tot_grde has 5 (5.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 09:51:19.336896 |
---|---|
Analysis finished | 2023-12-10 09:51:21.015578 |
Duration | 1.68 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
efc_yy
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2015 | |
---|---|
2021 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2015 |
---|---|
2nd row | 2021 |
3rd row | 2015 |
4th row | 2015 |
5th row | 2015 |
Common Values
Value | Count | Frequency (%) |
2015 | 97 | |
2021 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2015 | 97 | |
2021 | 3 | 3.0% |
qf_grade_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2급 장애인스포츠지도사 | |
---|---|
유소년스포츠지도사 | 3 |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 11.91 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2급 장애인스포츠지도사 |
---|---|
2nd row | 유소년스포츠지도사 |
3rd row | 2급 장애인스포츠지도사 |
4th row | 2급 장애인스포츠지도사 |
5th row | 2급 장애인스포츠지도사 |
Common Values
Value | Count | Frequency (%) |
2급 장애인스포츠지도사 | 97 | |
유소년스포츠지도사 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2급 | 97 | |
장애인스포츠지도사 | 97 | |
유소년스포츠지도사 | 3 | 1.5% |
cour_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
일반과정 | |
---|---|
특별과정 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반과정 |
---|---|
2nd row | 특별과정 |
3rd row | 일반과정 |
4th row | 일반과정 |
5th row | 일반과정 |
Common Values
Value | Count | Frequency (%) |
일반과정 | 97 | |
특별과정 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
일반과정 | 97 | |
특별과정 | 3 | 3.0% |
usr_no
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 1000 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | C000019915 |
---|---|
2nd row | P000176688 |
3rd row | C000024828 |
4th row | C000033959 |
5th row | C000035450 |
Value | Count | Frequency (%) |
c000019915 | 1 | 1.0% |
c000110462 | 1 | 1.0% |
c000119718 | 1 | 1.0% |
c000119526 | 1 | 1.0% |
c000118839 | 1 | 1.0% |
c000118683 | 1 | 1.0% |
c000118655 | 1 | 1.0% |
c000115180 | 1 | 1.0% |
c000114246 | 1 | 1.0% |
c000113585 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 396 | |
1 | 102 | 10.2% |
C | 97 | 9.7% |
6 | 61 | 6.1% |
2 | 60 | 6.0% |
9 | 53 | 5.3% |
8 | 50 | 5.0% |
3 | 49 | 4.9% |
5 | 46 | 4.6% |
4 | 43 | 4.3% |
Other values (2) | 43 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 900 | |
Uppercase Letter | 100 | 10.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 396 | |
1 | 102 | 11.3% |
6 | 61 | 6.8% |
2 | 60 | 6.7% |
9 | 53 | 5.9% |
8 | 50 | 5.6% |
3 | 49 | 5.4% |
5 | 46 | 5.1% |
4 | 43 | 4.8% |
7 | 40 | 4.4% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 97 | |
P | 3 | 3.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 900 | |
Latin | 100 | 10.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 396 | |
1 | 102 | 11.3% |
6 | 61 | 6.8% |
2 | 60 | 6.7% |
9 | 53 | 5.9% |
8 | 50 | 5.6% |
3 | 49 | 5.4% |
5 | 46 | 5.1% |
4 | 43 | 4.8% |
7 | 40 | 4.4% |
Latin
Value | Count | Frequency (%) |
C | 97 | |
P | 3 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 396 | |
1 | 102 | 10.2% |
C | 97 | 9.7% |
6 | 61 | 6.1% |
2 | 60 | 6.0% |
9 | 53 | 5.3% |
8 | 50 | 5.0% |
3 | 49 | 4.9% |
5 | 46 | 4.6% |
4 | 43 | 4.3% |
Other values (2) | 43 | 4.3% |
prtc_tot_grde
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 39 |
---|---|
Distinct (%) | 39.4% |
Missing | 1 |
Missing (%) | 1.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 75.393939 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 7 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 72 |
median | 81 |
Q3 | 88 |
95-th percentile | 100 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 16 |
Descriptive statistics
Standard deviation | 23.707322 |
---|---|
Coefficient of variation (CV) | 0.31444599 |
Kurtosis | 4.9028211 |
Mean | 75.393939 |
Median Absolute Deviation (MAD) | 7 |
Skewness | -2.2202834 |
Sum | 7464 |
Variance | 562.03711 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7 | 7.0% |
78 | 7 | 7.0% |
100 | 7 | 7.0% |
82 | 6 | 6.0% |
81 | 5 | 5.0% |
80 | 5 | 5.0% |
88 | 4 | 4.0% |
84 | 4 | 4.0% |
85 | 4 | 4.0% |
83 | 3 | 3.0% |
Other values (29) | 47 |
Value | Count | Frequency (%) |
0 | 7 | |
41 | 1 | 1.0% |
55 | 1 | 1.0% |
58 | 1 | 1.0% |
61 | 1 | 1.0% |
62 | 1 | 1.0% |
63 | 2 | 2.0% |
65 | 3 | |
66 | 1 | 1.0% |
67 | 2 | 2.0% |
Value | Count | Frequency (%) |
100 | 7 | |
99 | 1 | 1.0% |
98 | 1 | 1.0% |
97 | 1 | 1.0% |
96 | 2 | 2.0% |
95 | 2 | 2.0% |
94 | 3 | |
91 | 2 | 2.0% |
90 | 1 | 1.0% |
89 | 2 | 2.0% |
orst_tot_grde
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 36 |
---|---|
Distinct (%) | 36.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 69.49 |
Minimum | 0 |
---|---|
Maximum | 97 |
Zeros | 5 |
Zeros (%) | 5.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 18.05 |
Q1 | 57.75 |
median | 75 |
Q3 | 85 |
95-th percentile | 92.05 |
Maximum | 97 |
Range | 97 |
Interquartile range (IQR) | 27.25 |
Descriptive statistics
Standard deviation | 23.056582 |
---|---|
Coefficient of variation (CV) | 0.33179712 |
Kurtosis | 2.2258646 |
Mean | 69.49 |
Median Absolute Deviation (MAD) | 10.5 |
Skewness | -1.5739569 |
Sum | 6949 |
Variance | 531.60596 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
72 | 9 | 9.0% |
75 | 8 | 8.0% |
78 | 6 | 6.0% |
82 | 6 | 6.0% |
57 | 5 | 5.0% |
0 | 5 | 5.0% |
83 | 4 | 4.0% |
92 | 4 | 4.0% |
97 | 4 | 4.0% |
87 | 4 | 4.0% |
Other values (26) | 45 |
Value | Count | Frequency (%) |
0 | 5 | |
19 | 1 | 1.0% |
29 | 1 | 1.0% |
34 | 3 | |
36 | 1 | 1.0% |
39 | 1 | 1.0% |
45 | 2 | 2.0% |
48 | 1 | 1.0% |
50 | 1 | 1.0% |
52 | 2 | 2.0% |
Value | Count | Frequency (%) |
97 | 4 | |
93 | 1 | 1.0% |
92 | 4 | |
90 | 3 | |
89 | 2 | |
88 | 3 | |
87 | 4 | |
86 | 1 | 1.0% |
85 | 4 | |
84 | 1 | 1.0% |
prtc_pas_div_nm
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
합격 | |
---|---|
불합격 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.4 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 합격 |
---|---|
2nd row | 합격 |
3rd row | 합격 |
4th row | 합격 |
5th row | 합격 |
Common Values
Value | Count | Frequency (%) |
합격 | 60 | |
불합격 | 40 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
합격 | 60 | |
불합격 | 40 |
fnl_pas_yn
Boolean
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 53 | |
False | 47 |
qf_itm_nm
Categorical
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | 24.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
수영 | |
---|---|
역도 | |
배드민턴 | |
보치아 | |
배구 | |
Other values (19) |
Length
Max length | 18 |
---|---|
Median length | 2 |
Mean length | 2.61 |
Min length | 2 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 9.0% |
Sample
1st row | 보치아 |
---|---|
2nd row | 유도 |
3rd row | 볼링 |
4th row | 배구 |
5th row | 육상 |
Common Values
Value | Count | Frequency (%) |
수영 | 13 | |
역도 | 13 | |
배드민턴 | 10 | |
보치아 | 8 | |
배구 | 8 | |
축구 | 7 | 7.0% |
태권도 | 7 | 7.0% |
럭비 | 5 | 5.0% |
육상 | 5 | 5.0% |
농구 | 5 | 5.0% |
Other values (14) | 19 |
Length
Value | Count | Frequency (%) |
수영 | 13 | |
역도 | 13 | |
배드민턴 | 10 | |
보치아 | 8 | |
배구 | 8 | |
축구 | 7 | 7.0% |
태권도 | 7 | 7.0% |
럭비 | 5 | 5.0% |
육상 | 5 | 5.0% |
농구 | 5 | 5.0% |
Other values (14) | 19 |
zon_nm
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 100 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.0 KiB |
efc_yy | qf_grade_nm | cour_nm | usr_no | prtc_tot_grde | orst_tot_grde | prtc_pas_div_nm | fnl_pas_yn | qf_itm_nm | |
---|---|---|---|---|---|---|---|---|---|
efc_yy | 1.000 | 0.963 | 0.963 | 1.000 | 0.245 | 0.000 | 0.000 | 0.126 | 0.709 |
qf_grade_nm | 0.963 | 1.000 | 0.963 | 1.000 | 0.245 | 0.000 | 0.000 | 0.126 | 0.709 |
cour_nm | 0.963 | 0.963 | 1.000 | 1.000 | 0.245 | 0.000 | 0.000 | 0.126 | 0.709 |
usr_no | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
prtc_tot_grde | 0.245 | 0.245 | 0.245 | 1.000 | 1.000 | 0.785 | 0.632 | 0.539 | 0.590 |
orst_tot_grde | 0.000 | 0.000 | 0.000 | 1.000 | 0.785 | 1.000 | 0.912 | 0.793 | 0.000 |
prtc_pas_div_nm | 0.000 | 0.000 | 0.000 | 1.000 | 0.632 | 0.912 | 1.000 | 0.970 | 0.731 |
fnl_pas_yn | 0.126 | 0.126 | 0.126 | 1.000 | 0.539 | 0.793 | 0.970 | 1.000 | 0.743 |
qf_itm_nm | 0.709 | 0.709 | 0.709 | 1.000 | 0.590 | 0.000 | 0.731 | 0.743 | 1.000 |
efc_yy | cour_nm | fnl_pas_yn | qf_itm_nm | prtc_pas_div_nm | qf_grade_nm | |
---|---|---|---|---|---|---|
efc_yy | 1.000 | 0.826 | 0.080 | 0.557 | 0.000 | 0.826 |
cour_nm | 0.826 | 1.000 | 0.080 | 0.557 | 0.000 | 0.826 |
fnl_pas_yn | 0.080 | 0.080 | 1.000 | 0.587 | 0.845 | 0.080 |
qf_itm_nm | 0.557 | 0.557 | 0.587 | 1.000 | 0.575 | 0.557 |
prtc_pas_div_nm | 0.000 | 0.000 | 0.845 | 0.575 | 1.000 | 0.000 |
qf_grade_nm | 0.826 | 0.826 | 0.080 | 0.557 | 0.000 | 1.000 |
prtc_tot_grde | orst_tot_grde | efc_yy | qf_grade_nm | cour_nm | prtc_pas_div_nm | fnl_pas_yn | qf_itm_nm | |
---|---|---|---|---|---|---|---|---|
prtc_tot_grde | 1.000 | 0.307 | 0.254 | 0.254 | 0.254 | 0.663 | 0.564 | 0.270 |
orst_tot_grde | 0.307 | 1.000 | 0.000 | 0.000 | 0.000 | 0.722 | 0.602 | 0.000 |
efc_yy | 0.254 | 0.000 | 1.000 | 0.826 | 0.826 | 0.000 | 0.080 | 0.557 |
qf_grade_nm | 0.254 | 0.000 | 0.826 | 1.000 | 0.826 | 0.000 | 0.080 | 0.557 |
cour_nm | 0.254 | 0.000 | 0.826 | 0.826 | 1.000 | 0.000 | 0.080 | 0.557 |
prtc_pas_div_nm | 0.663 | 0.722 | 0.000 | 0.000 | 0.000 | 1.000 | 0.845 | 0.575 |
fnl_pas_yn | 0.564 | 0.602 | 0.080 | 0.080 | 0.080 | 0.845 | 1.000 | 0.587 |
qf_itm_nm | 0.270 | 0.000 | 0.557 | 0.557 | 0.557 | 0.575 | 0.587 | 1.000 |
efc_yy | qf_grade_nm | cour_nm | usr_no | prtc_tot_grde | orst_tot_grde | prtc_pas_div_nm | fnl_pas_yn | qf_itm_nm | zon_nm | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000019915 | 82 | 83 | 합격 | Y | 보치아 | <NA> |
1 | 2021 | 유소년스포츠지도사 | 특별과정 | P000176688 | <NA> | 82 | 합격 | N | 유도 | <NA> |
2 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000024828 | 81 | 92 | 합격 | Y | 볼링 | <NA> |
3 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000033959 | 95 | 97 | 합격 | Y | 배구 | <NA> |
4 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000035450 | 80 | 87 | 합격 | Y | 육상 | <NA> |
5 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000038412 | 99 | 89 | 합격 | N | 수영 | <NA> |
6 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000041380 | 80 | 92 | 합격 | Y | 육상 | <NA> |
7 | 2021 | 유소년스포츠지도사 | 특별과정 | P000213697 | 69 | 73 | 불합격 | N | 검도 | <NA> |
8 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000042577 | 77 | 69 | 불합격 | N | 역도 | <NA> |
9 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000043773 | 94 | 88 | 합격 | Y | 수영 | <NA> |
efc_yy | qf_grade_nm | cour_nm | usr_no | prtc_tot_grde | orst_tot_grde | prtc_pas_div_nm | fnl_pas_yn | qf_itm_nm | zon_nm | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000127018 | 96 | 75 | 합격 | Y | 럭비 | <NA> |
91 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000127382 | 91 | 83 | 합격 | Y | 테니스 | <NA> |
92 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000127441 | 82 | 34 | 불합격 | N | 역도 | <NA> |
93 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000128200 | 78 | 75 | 합격 | Y | 배드민턴 | <NA> |
94 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000128653 | 87 | 78 | 합격 | Y | 축구 | <NA> |
95 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000129091 | 65 | 87 | 불합격 | N | 축구 | <NA> |
96 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000129171 | 84 | 75 | 합격 | Y | 보치아 | <NA> |
97 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000129303 | 91 | 36 | 불합격 | N | 수영 | <NA> |
98 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000130699 | 83 | 97 | 합격 | N | 배구 | <NA> |
99 | 2015 | 2급 장애인스포츠지도사 | 일반과정 | C000131192 | 81 | 93 | 합격 | Y | 배구 | <NA> |