Dataset statistics
Number of variables | 27 |
---|---|
Number of observations | 30 |
Missing cells | 53 |
Missing cells (%) | 6.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.8 KiB |
Average record size in memory | 233.4 B |
Variable types
Numeric | 7 |
---|---|
Categorical | 13 |
Boolean | 5 |
Text | 2 |
Dataset
Description | 조혈모세포이식 환자의 사망원인 혈액질환 레지스트리 데이터셋에서 임상의에 직접기입된 사망원인 분류 데이터를 출력, 공여자 데이터, Chronic GVHD의 발생여부와 발생장기의 정보를 확인할 수 있음. 공여자의 혈액형의 체크박스에서 데이터 추출 하여 혈액형 정보를 숫자(A+ =11, A- = 10, AB+ =31, AB- =30, B+ =21, B- =20, O+ =41, O- =40 )로 표기하였음. |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/stem-cell-transplantation |
TRNSPLANT_YN has constant value "" | Constant |
ANC_ENG_YN has constant value "" | Constant |
CELL_SOURCE is highly imbalanced (78.9%) | Imbalance |
ECOG is highly imbalanced (68.6%) | Imbalance |
AGE_DONOR has 1 (3.3%) missing values | Missing |
HCTCI has 2 (6.7%) missing values | Missing |
CD3_INFU has 1 (3.3%) missing values | Missing |
ANC_ENG_YN has 1 (3.3%) missing values | Missing |
PLT_ENG_YN has 3 (10.0%) missing values | Missing |
AGVHD_YN has 1 (3.3%) missing values | Missing |
CGVHD_YN has 1 (3.3%) missing values | Missing |
CGVHD_SITE has 19 (63.3%) missing values | Missing |
CGVHD_ADD_DRUG has 24 (80.0%) missing values | Missing |
PID has unique values | Unique |
HCTCI has 13 (43.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-10-08 18:57:47.236058 |
---|---|
Analysis finished | 2023-10-08 18:57:48.024516 |
Duration | 0.79 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
PID
Real number (ℝ)
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
SEX_PATIENT
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 19 | |
2 | 11 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 19 | |
2 | 11 |
AGE_PATIENT
Real number (ℝ)
Distinct | 22 |
---|---|
Distinct (%) | 73.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 54.466667 |
Minimum | 25 |
---|---|
Maximum | 90 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 25 |
---|---|
5-th percentile | 29.35 |
Q1 | 44.5 |
median | 58.5 |
Q3 | 64 |
95-th percentile | 74.55 |
Maximum | 90 |
Range | 65 |
Interquartile range (IQR) | 19.5 |
Descriptive statistics
Standard deviation | 15.90648 |
---|---|
Coefficient of variation (CV) | 0.29204063 |
Kurtosis | -0.39072746 |
Mean | 54.466667 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.097505464 |
Sum | 1634 |
Variance | 253.01609 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
59 | 3 | 10.0% |
61 | 2 | 6.7% |
60 | 2 | 6.7% |
31 | 2 | 6.7% |
52 | 2 | 6.7% |
67 | 2 | 6.7% |
74 | 2 | 6.7% |
65 | 1 | 3.3% |
58 | 1 | 3.3% |
28 | 1 | 3.3% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
25 | 1 | |
28 | 1 | |
31 | 2 | |
34 | 1 | |
36 | 1 | |
39 | 1 | |
44 | 1 | |
46 | 1 | |
47 | 1 | |
51 | 1 |
Value | Count | Frequency (%) |
90 | 1 | 3.3% |
75 | 1 | 3.3% |
74 | 2 | |
72 | 1 | 3.3% |
67 | 2 | |
65 | 1 | 3.3% |
61 | 2 | |
60 | 2 | |
59 | 3 | |
58 | 1 | 3.3% |
TRNSPLANT_YN
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 162.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 30 |
COD
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.4 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1.5 |
median | 4 |
Q3 | 4 |
95-th percentile | 5.55 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 2.5 |
Descriptive statistics
Standard deviation | 1.6315848 |
---|---|
Coefficient of variation (CV) | 0.47987788 |
Kurtosis | -0.37781029 |
Mean | 3.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -0.29397473 |
Sum | 102 |
Variance | 2.662069 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 17 | |
1 | 8 | |
5 | 2 | 6.7% |
3 | 1 | 3.3% |
7 | 1 | 3.3% |
6 | 1 | 3.3% |
Value | Count | Frequency (%) |
1 | 8 | |
3 | 1 | 3.3% |
4 | 17 | |
5 | 2 | 6.7% |
6 | 1 | 3.3% |
7 | 1 | 3.3% |
Value | Count | Frequency (%) |
7 | 1 | 3.3% |
6 | 1 | 3.3% |
5 | 2 | 6.7% |
4 | 17 | |
3 | 1 | 3.3% |
1 | 8 |
ABO_PATIENT
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 16.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
41 | |
---|---|
21 | |
11 | |
31 | |
Q |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.9666667 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 31 |
---|---|
2nd row | 41 |
3rd row | Q |
4th row | 21 |
5th row | 41 |
Common Values
Value | Count | Frequency (%) |
41 | 9 | |
21 | 8 | |
11 | 8 | |
31 | 4 | |
Q | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
41 | 9 | |
21 | 8 | |
11 | 8 | |
31 | 4 | |
q | 1 | 3.3% |
DONOR_TYPE
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
URD | |
---|---|
HAPLO | |
SIB | |
AUTO | 1 |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.6333333 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | URD |
---|---|
2nd row | AUTO |
3rd row | URD |
4th row | URD |
5th row | URD |
Common Values
Value | Count | Frequency (%) |
URD | 14 | |
HAPLO | 9 | |
SIB | 6 | |
AUTO | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
urd | 14 | |
haplo | 9 | |
sib | 6 | |
auto | 1 | 3.3% |
HLA_MATCH
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Match | |
---|---|
HAPLO | |
Mismatch | |
AUTO | 1 |
Length
Max length | 8 |
---|---|
Median length | 5 |
Mean length | 5.5666667 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | Match |
---|---|
2nd row | AUTO |
3rd row | Match |
4th row | Mismatch |
5th row | Mismatch |
Common Values
Value | Count | Frequency (%) |
Match | 14 | |
HAPLO | 9 | |
Mismatch | 6 | |
AUTO | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
match | 14 | |
haplo | 9 | |
mismatch | 6 | |
auto | 1 | 3.3% |
CELL_SOURCE
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
PBSC | |
---|---|
BM_PBSC | 1 |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 4.1 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | PBSC |
---|---|
2nd row | BM_PBSC |
3rd row | PBSC |
4th row | PBSC |
5th row | PBSC |
Common Values
Value | Count | Frequency (%) |
PBSC | 29 | |
BM_PBSC | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
pbsc | 29 | |
bm_pbsc | 1 | 3.3% |
SEX_DONOR
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
1 | |
---|---|
2 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 1 |
---|---|
2nd row | <NA> |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 20 | |
2 | 9 | |
<NA> | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 20 | |
2 | 9 | |
na | 1 | 3.3% |
AGE_DONOR
Real number (ℝ)
MISSING
 
Distinct | 23 |
---|---|
Distinct (%) | 79.3% |
Missing | 1 |
Missing (%) | 3.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36.827586 |
Minimum | 9 |
---|---|
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 9 |
---|---|
5-th percentile | 17.6 |
Q1 | 28 |
median | 36 |
Q3 | 45 |
95-th percentile | 56.8 |
Maximum | 63 |
Range | 54 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 12.88429 |
---|---|
Coefficient of variation (CV) | 0.34985431 |
Kurtosis | -0.26965707 |
Mean | 36.827586 |
Median Absolute Deviation (MAD) | 9 |
Skewness | -0.010449526 |
Sum | 1068 |
Variance | 166.00493 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
36 | 3 | 10.0% |
25 | 2 | 6.7% |
33 | 2 | 6.7% |
35 | 2 | 6.7% |
45 | 2 | 6.7% |
26 | 1 | 3.3% |
16 | 1 | 3.3% |
28 | 1 | 3.3% |
49 | 1 | 3.3% |
39 | 1 | 3.3% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
9 | 1 | |
16 | 1 | |
20 | 1 | |
21 | 1 | |
25 | 2 | |
26 | 1 | |
28 | 1 | |
31 | 1 | |
33 | 2 | |
34 | 1 |
Value | Count | Frequency (%) |
63 | 1 | |
58 | 1 | |
55 | 1 | |
54 | 1 | |
49 | 1 | |
48 | 1 | |
47 | 1 | |
45 | 2 | |
44 | 1 | |
42 | 1 |
ABO_DONOR
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 16.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
11 | |
---|---|
21 | |
41 | |
31 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0666667 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 11 |
---|---|
2nd row | <NA> |
3rd row | 11 |
4th row | 31 |
5th row | 21 |
Common Values
Value | Count | Frequency (%) |
11 | 11 | |
21 | 7 | |
41 | 6 | |
31 | 5 | |
<NA> | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11 | 11 | |
21 | 7 | |
41 | 6 | |
31 | 5 | |
na | 1 | 3.3% |
ABO_MATCHING
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 16.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Match | |
---|---|
<NA> | |
Major mismatch | |
Bidirectional mismatch | |
Minor mismatch | 1 |
Length
Max length | 22 |
---|---|
Median length | 18 |
Mean length | 7.0333333 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | Major mismatch |
Common Values
Value | Count | Frequency (%) |
Match | 15 | |
<NA> | 9 | |
Major mismatch | 3 | 10.0% |
Bidirectional mismatch | 2 | 6.7% |
Minor mismatch | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
match | 15 | |
na | 9 | |
mismatch | 6 | 16.7% |
major | 3 | 8.3% |
bidirectional | 2 | 5.6% |
minor | 1 | 2.8% |
HCTCI
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 21.4% |
Missing | 2 |
Missing (%) | 6.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6785714 |
Minimum | 0 |
---|---|
Maximum | 6 |
Zeros | 13 |
Zeros (%) | 43.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1.5 |
Q3 | 3 |
95-th percentile | 4 |
Maximum | 6 |
Range | 6 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.8268283 |
---|---|
Coefficient of variation (CV) | 1.0883232 |
Kurtosis | -0.83216583 |
Mean | 1.6785714 |
Median Absolute Deviation (MAD) | 1.5 |
Skewness | 0.59416062 |
Sum | 47 |
Variance | 3.3373016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 13 | |
4 | 5 | 16.7% |
2 | 4 | 13.3% |
3 | 4 | 13.3% |
6 | 1 | 3.3% |
1 | 1 | 3.3% |
(Missing) | 2 | 6.7% |
Value | Count | Frequency (%) |
0 | 13 | |
1 | 1 | 3.3% |
2 | 4 | 13.3% |
3 | 4 | 13.3% |
4 | 5 | 16.7% |
6 | 1 | 3.3% |
Value | Count | Frequency (%) |
6 | 1 | 3.3% |
4 | 5 | 16.7% |
3 | 4 | 13.3% |
2 | 4 | 13.3% |
1 | 1 | 3.3% |
0 | 13 |
ECOG
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
1 | |
---|---|
<NA> | 1 |
11 | 1 |
0 | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1333333 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 10.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 27 | |
<NA> | 1 | 3.3% |
11 | 1 | 3.3% |
0 | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 27 | |
na | 1 | 3.3% |
11 | 1 | 3.3% |
0 | 1 | 3.3% |
KPS
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
90 | |
---|---|
80 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0666667 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 18 | |
80 | 11 | |
<NA> | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 18 | |
80 | 11 | |
na | 1 | 3.3% |
CD34_INFU
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 33.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.2666667 |
Minimum | 1 |
---|---|
Maximum | 21 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 5 |
median | 6 |
Q3 | 6.75 |
95-th percentile | 12.2 |
Maximum | 21 |
Range | 20 |
Interquartile range (IQR) | 1.75 |
Descriptive statistics
Standard deviation | 3.6096932 |
---|---|
Coefficient of variation (CV) | 0.57601487 |
Kurtosis | 9.6756511 |
Mean | 6.2666667 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.6952094 |
Sum | 188 |
Variance | 13.029885 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 9 | |
5 | 7 | |
7 | 4 | |
3 | 3 | 10.0% |
4 | 2 | 6.7% |
1 | 1 | 3.3% |
8 | 1 | 3.3% |
14 | 1 | 3.3% |
10 | 1 | 3.3% |
21 | 1 | 3.3% |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
3 | 3 | 10.0% |
4 | 2 | 6.7% |
5 | 7 | |
6 | 9 | |
7 | 4 | |
8 | 1 | 3.3% |
10 | 1 | 3.3% |
14 | 1 | 3.3% |
21 | 1 | 3.3% |
Value | Count | Frequency (%) |
21 | 1 | 3.3% |
14 | 1 | 3.3% |
10 | 1 | 3.3% |
8 | 1 | 3.3% |
7 | 4 | |
6 | 9 | |
5 | 7 | |
4 | 2 | 6.7% |
3 | 3 | 10.0% |
1 | 1 | 3.3% |
CD3_INFU
Real number (ℝ)
MISSING
 
Distinct | 29 |
---|---|
Distinct (%) | 100.0% |
Missing | 1 |
Missing (%) | 3.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 334.65517 |
Minimum | 121 |
---|---|
Maximum | 582 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 121 |
---|---|
5-th percentile | 197 |
Q1 | 259 |
median | 315 |
Q3 | 405 |
95-th percentile | 557.2 |
Maximum | 582 |
Range | 461 |
Interquartile range (IQR) | 146 |
Descriptive statistics
Standard deviation | 111.86065 |
---|---|
Coefficient of variation (CV) | 0.33425646 |
Kurtosis | 0.20380756 |
Mean | 334.65517 |
Median Absolute Deviation (MAD) | 60 |
Skewness | 0.65238365 |
Sum | 9705 |
Variance | 12512.805 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
223 | 1 | 3.3% |
306 | 1 | 3.3% |
189 | 1 | 3.3% |
268 | 1 | 3.3% |
433 | 1 | 3.3% |
435 | 1 | 3.3% |
347 | 1 | 3.3% |
316 | 1 | 3.3% |
405 | 1 | 3.3% |
538 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
121 | 1 | |
189 | 1 | |
209 | 1 | |
223 | 1 | |
239 | 1 | |
241 | 1 | |
255 | 1 | |
259 | 1 | |
268 | 1 | |
291 | 1 |
Value | Count | Frequency (%) |
582 | 1 | |
570 | 1 | |
538 | 1 | |
491 | 1 | |
435 | 1 | |
433 | 1 | |
411 | 1 | |
405 | 1 | |
366 | 1 | |
347 | 1 |
ANC_ENG_YN
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 3.4% |
Missing | 1 |
Missing (%) | 3.3% |
Memory size | 192.0 B |
True | |
---|---|
(Missing) | 1 |
Value | Count | Frequency (%) |
True | 29 | |
(Missing) | 1 | 3.3% |
PLT_ENG_YN
Boolean
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 7.4% |
Missing | 3 |
Missing (%) | 10.0% |
Memory size | 192.0 B |
True | |
---|---|
False | |
(Missing) |
Value | Count | Frequency (%) |
True | 24 | |
False | 3 | 10.0% |
(Missing) | 3 | 10.0% |
RELAPSE_YN
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
N | |
---|---|
Persistent | |
Y |
Length
Max length | 10 |
---|---|
Median length | 1 |
Mean length | 4 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Y |
---|---|
2nd row | Y |
3rd row | N |
4th row | Persistent |
5th row | Persistent |
Common Values
Value | Count | Frequency (%) |
N | 15 | |
Persistent | 10 | |
Y | 5 | 16.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
n | 15 | |
persistent | 10 | |
y | 5 | 16.7% |
AGVHD_YN
Boolean
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 6.9% |
Missing | 1 |
Missing (%) | 3.3% |
Memory size | 192.0 B |
False | |
---|---|
True | |
(Missing) | 1 |
Value | Count | Frequency (%) |
False | 17 | |
True | 12 | |
(Missing) | 1 | 3.3% |
AGVHD_MAX_GR
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 16.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
0 | |
---|---|
2 | |
1 | |
3 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 0 |
---|---|
2nd row | <NA> |
3rd row | 0 |
4th row | 0 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
0 | 17 | |
2 | 6 | 20.0% |
1 | 4 | 13.3% |
3 | 2 | 6.7% |
<NA> | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 17 | |
2 | 6 | 20.0% |
1 | 4 | 13.3% |
3 | 2 | 6.7% |
na | 1 | 3.3% |
AGVHD_ADD_DRUG
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
<NA> | |
---|---|
jakavi,mmf | |
jakavi | 2 |
mmf | 1 |
Length
Max length | 10 |
---|---|
Median length | 4 |
Mean length | 4.9 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | jakavi,mmf |
Common Values
Value | Count | Frequency (%) |
<NA> | 23 | |
jakavi,mmf | 4 | 13.3% |
jakavi | 2 | 6.7% |
mmf | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 23 | |
jakavi,mmf | 4 | 13.3% |
jakavi | 2 | 6.7% |
mmf | 1 | 3.3% |
CGVHD_YN
Boolean
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 6.9% |
Missing | 1 |
Missing (%) | 3.3% |
Memory size | 192.0 B |
False | |
---|---|
True | |
(Missing) | 1 |
Value | Count | Frequency (%) |
False | 18 | |
True | 11 | |
(Missing) | 1 | 3.3% |
CGVHD_SITE
Text
MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 81.8% |
Missing | 19 |
Missing (%) | 63.3% |
Memory size | 372.0 B |
Length
Max length | 26 |
---|---|
Median length | 22 |
Mean length | 13.181818 |
Min length | 4 |
Characters and Unicode
Total characters | 145 |
---|---|
Distinct characters | 20 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 72.7% |
Sample
1st row | mouth |
---|---|
2nd row | skin |
3rd row | skin,nail,hematopietic |
4th row | skin,nail,eyes,liver |
5th row | skin |
Value | Count | Frequency (%) |
skin | 3 | |
mouth | 1 | 9.1% |
skin,nail,hematopietic | 1 | 9.1% |
skin,nail,eyes,liver | 1 | 9.1% |
eyes | 1 | 9.1% |
skin,nail,eyes,gi,liver | 1 | 9.1% |
skin,nail,gi,liver | 1 | 9.1% |
skin,nail,mouth,eyes,liver | 1 | 9.1% |
skin,nail,liver | 1 | 9.1% |
Most occurring characters
Value | Count | Frequency (%) |
i | 22 | |
, | 18 | |
n | 15 | |
e | 15 | |
s | 13 | |
l | 11 | |
k | 9 | 6.2% |
a | 7 | 4.8% |
v | 5 | 3.4% |
r | 5 | 3.4% |
Other values (10) | 25 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 123 | |
Other Punctuation | 18 | 12.4% |
Uppercase Letter | 4 | 2.8% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 22 | |
n | 15 | |
e | 15 | |
s | 13 | |
l | 11 | |
k | 9 | |
a | 7 | 5.7% |
v | 5 | 4.1% |
r | 5 | 4.1% |
t | 4 | 3.3% |
Other values (7) | 17 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 2 | |
I | 2 |
Other Punctuation
Value | Count | Frequency (%) |
, | 18 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 127 | |
Common | 18 | 12.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 22 | |
n | 15 | |
e | 15 | |
s | 13 | |
l | 11 | |
k | 9 | |
a | 7 | 5.5% |
v | 5 | 3.9% |
r | 5 | 3.9% |
t | 4 | 3.1% |
Other values (9) | 21 |
Common
Value | Count | Frequency (%) |
, | 18 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 145 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 22 | |
, | 18 | |
n | 15 | |
e | 15 | |
s | 13 | |
l | 11 | |
k | 9 | 6.2% |
a | 7 | 4.8% |
v | 5 | 3.4% |
r | 5 | 3.4% |
Other values (10) | 25 |
CGVHD_ADD_DRUG
Text
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | 50.0% |
Missing | 24 |
Missing (%) | 80.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
jakavi,mmf | 3 | |
jakavi | 2 | |
mmf | 1 | 16.7% |
Most occurring characters
Value | Count | Frequency (%) |
a | 10 | |
m | 8 | |
j | 5 | |
k | 5 | |
v | 5 | |
i | 5 | |
f | 4 | 8.9% |
, | 3 | 6.7% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 42 | |
Other Punctuation | 3 | 6.7% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 10 | |
m | 8 | |
j | 5 | |
k | 5 | |
v | 5 | |
i | 5 | |
f | 4 | 9.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 42 | |
Common | 3 | 6.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 10 | |
m | 8 | |
j | 5 | |
k | 5 | |
v | 5 | |
i | 5 | |
f | 4 | 9.5% |
Common
Value | Count | Frequency (%) |
, | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 45 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 10 | |
m | 8 | |
j | 5 | |
k | 5 | |
v | 5 | |
i | 5 | |
f | 4 | 8.9% |
, | 3 | 6.7% |
PID | SEX_PATIENT | AGE_PATIENT | TRNSPLANT_YN | COD | ABO_PATIENT | DONOR_TYPE | HLA_MATCH | CELL_SOURCE | SEX_DONOR | AGE_DONOR | ABO_DONOR | ABO_MATCHING | HCTCI | ECOG | KPS | CD34_INFU | CD3_INFU | ANC_ENG_YN | PLT_ENG_YN | RELAPSE_YN | AGVHD_YN | AGVHD_MAX_GR | AGVHD_ADD_DRUG | CGVHD_YN | CGVHD_SITE | CGVHD_ADD_DRUG | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 1 | 65 | Y | 4 | 31 | URD | Match | PBSC | 1 | 26 | 11 | <NA> | 0 | 1 | 90 | 6 | 223 | Y | Y | Y | N | 0 | <NA> | N | <NA> | <NA> |
1 | 2 | 2 | 67 | Y | 4 | 41 | AUTO | AUTO | BM_PBSC | <NA> | <NA> | <NA> | <NA> | 0 | 1 | 90 | 1 | <NA> | Y | Y | Y | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | 3 | 2 | 44 | Y | 5 | Q | URD | Match | PBSC | 2 | 20 | 11 | <NA> | 2 | 1 | 90 | 6 | 306 | Y | N | N | N | 0 | <NA> | N | <NA> | <NA> |
3 | 4 | 1 | 60 | Y | 1 | 21 | URD | Mismatch | PBSC | 1 | 25 | 31 | <NA> | 4 | 1 | 90 | 3 | 255 | <NA> | N | Persistent | N | 0 | <NA> | N | <NA> | <NA> |
4 | 5 | 1 | 59 | Y | 1 | 41 | URD | Mismatch | PBSC | 1 | 34 | 21 | Major mismatch | 0 | 1 | 90 | 7 | 293 | Y | Y | Persistent | Y | 2 | jakavi,mmf | N | <NA> | <NA> |
5 | 6 | 1 | 31 | Y | 1 | 21 | HAPLO | HAPLO | PBSC | 1 | 58 | 11 | Bidirectional mismatch | 0 | 1 | 80 | 5 | 241 | Y | Y | Persistent | N | 0 | <NA> | Y | mouth | <NA> |
6 | 7 | 1 | 60 | Y | 4 | 31 | URD | Match | PBSC | 1 | 33 | 31 | Match | 0 | 1 | 90 | 8 | 315 | Y | Y | N | N | 0 | <NA> | N | <NA> | <NA> |
7 | 8 | 1 | 46 | Y | 4 | 21 | HAPLO | HAPLO | PBSC | 1 | 9 | 21 | Match | 0 | 1 | 90 | 3 | 259 | Y | <NA> | Persistent | N | 0 | <NA> | N | <NA> | <NA> |
8 | 9 | 1 | 25 | Y | 1 | 11 | URD | Match | PBSC | 2 | 35 | 11 | Match | 4 | 1 | 90 | 14 | 582 | Y | <NA> | Persistent | Y | 1 | <NA> | Y | skin | <NA> |
9 | 10 | 2 | 31 | Y | 4 | 31 | HAPLO | HAPLO | PBSC | 1 | 63 | 21 | <NA> | 3 | 1 | 90 | 6 | 366 | Y | Y | N | Y | 2 | <NA> | N | <NA> | <NA> |
PID | SEX_PATIENT | AGE_PATIENT | TRNSPLANT_YN | COD | ABO_PATIENT | DONOR_TYPE | HLA_MATCH | CELL_SOURCE | SEX_DONOR | AGE_DONOR | ABO_DONOR | ABO_MATCHING | HCTCI | ECOG | KPS | CD34_INFU | CD3_INFU | ANC_ENG_YN | PLT_ENG_YN | RELAPSE_YN | AGVHD_YN | AGVHD_MAX_GR | AGVHD_ADD_DRUG | CGVHD_YN | CGVHD_SITE | CGVHD_ADD_DRUG | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
20 | 21 | 1 | 34 | Y | 5 | 41 | URD | Mismatch | PBSC | 2 | 35 | 41 | Match | 0 | 1 | 80 | 3 | 411 | Y | Y | Persistent | Y | 2 | jakavi | N | <NA> | <NA> |
21 | 22 | 2 | 39 | Y | 1 | 41 | SIB | Match | PBSC | 1 | 31 | 41 | Match | 0 | 1 | 80 | 10 | 570 | Y | Y | Persistent | Y | 2 | <NA> | N | <NA> | <NA> |
22 | 23 | 2 | 36 | Y | 4 | 11 | URD | Mismatch | PBSC | 2 | 45 | 11 | Match | 0 | 1 | 90 | 5 | 538 | Y | Y | N | Y | 3 | <NA> | Y | skin,nail,eyes,GI,liver | jakavi,mmf |
23 | 24 | 2 | 52 | Y | 4 | 41 | SIB | Match | PBSC | 1 | 39 | 21 | Major mismatch | 2 | 1 | 80 | 21 | 405 | Y | Y | N | N | 0 | <NA> | Y | skin,nail,GI,liver | jakavi |
24 | 25 | 1 | 61 | Y | 4 | 21 | URD | Match | PBSC | 1 | 36 | 31 | <NA> | 4 | 1 | 90 | 4 | 316 | Y | Y | N | Y | 3 | mmf | N | <NA> | <NA> |
25 | 26 | 1 | 28 | Y | 1 | 41 | URD | Match | PBSC | 1 | 36 | 41 | Match | 2 | 1 | 90 | 6 | 347 | Y | Y | Y | Y | 1 | jakavi,mmf | Y | skin | mmf |
26 | 27 | 1 | 52 | Y | 4 | 11 | URD | Mismatch | PBSC | 1 | 45 | 21 | Bidirectional mismatch | 4 | 1 | 80 | 5 | 435 | Y | N | Persistent | Y | 2 | <NA> | N | <NA> | <NA> |
27 | 28 | 2 | 58 | Y | 6 | 11 | HAPLO | HAPLO | PBSC | 1 | 33 | 11 | Match | 2 | 11 | 80 | 5 | 433 | Y | Y | N | N | 0 | <NA> | Y | skin,nail,mouth,eyes,liver | <NA> |
28 | 29 | 1 | 59 | Y | 4 | 41 | SIB | Mismatch | PBSC | 1 | 49 | 41 | Match | 3 | 0 | 90 | 7 | 268 | Y | Y | Persistent | N | 0 | <NA> | Y | skin,nail,liver | jakavi,mmf |
29 | 30 | 1 | 74 | Y | 1 | 11 | URD | Match | PBSC | 1 | 28 | 11 | Match | 0 | 1 | 90 | 6 | 189 | Y | Y | Y | N | 0 | <NA> | N | <NA> | <NA> |