Dataset statistics
Number of variables | 33 |
---|---|
Number of observations | 100 |
Missing cells | 1545 |
Missing cells (%) | 46.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 27.5 KiB |
Average record size in memory | 281.3 B |
Variable types
Numeric | 6 |
---|---|
DateTime | 16 |
Categorical | 11 |
Dataset
Description | 당뇨 환자의 처방 약물 코드와 최초 처방일과 최종 처방일. sulfonylurea (RxNorm 코드: 1597772, 1597758, 1597773, 19101729, 21133671, 19059797), sulfonylurea+metformin(42953698, 42953917, 42953740), meglitinide(19023425, 19023424, 19023426, 42962884, 19107111, 19107110, 1502829), metformin(19106521, 40164929, 40164946, 40164897, 40164894, 40164925), TZD(1525221, 19079293, 42960773), DPP4i(19125041, 40239218, 43013911, 43013924, 42960599, 42961500), DPP4i-MET(40164922, 42708088,42708090, 42708086), Insulin(46234044, 35782236, 35779361, 41348914, 35786039, 36809748, 42920572, 46234044, 41370419, 41349142, 46234044, 35782557, 35159339, 35781503, 35781503, 46234044, 46234044, 586875, 35781503, 35781503, 46234044, 41348508, 40717097 , 35779506, 40755064, 42921713) |
---|---|
Author | 가톨릭대학교 은평성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_pre-eunpyeong |
Meg_f_date has constant value "" | Constant |
Meg_l_date has constant value "" | Constant |
SU-MET_f_prcd is highly imbalanced (67.1%) | Imbalance |
SU-MET_l_prcd is highly imbalanced (65.9%) | Imbalance |
Meg_f_prcd is highly imbalanced (91.9%) | Imbalance |
Meg_l_prcd is highly imbalanced (91.9%) | Imbalance |
TZD_f_prcd is highly imbalanced (87.9%) | Imbalance |
TZD_l_prcd is highly imbalanced (89.8%) | Imbalance |
DPP4i-MET_f_prcd is highly imbalanced (56.5%) | Imbalance |
DPP4i-MET_l_prcd is highly imbalanced (70.8%) | Imbalance |
SU_f_date has 62 (62.0%) missing values | Missing |
SU_f_prcd has 62 (62.0%) missing values | Missing |
SU_l_date has 65 (65.0%) missing values | Missing |
SU-MET_f_date has 89 (89.0%) missing values | Missing |
SU-MET_l_date has 90 (90.0%) missing values | Missing |
Meg_f_date has 99 (99.0%) missing values | Missing |
Meg_l_date has 99 (99.0%) missing values | Missing |
Met_f_date has 55 (55.0%) missing values | Missing |
Met_f_prcd has 55 (55.0%) missing values | Missing |
Met_l_date has 58 (58.0%) missing values | Missing |
Met_l_prcd has 58 (58.0%) missing values | Missing |
TZD_f_date has 97 (97.0%) missing values | Missing |
TZD_l_date has 98 (98.0%) missing values | Missing |
DPP4i_f_date has 65 (65.0%) missing values | Missing |
DPP4i_l_date has 67 (67.0%) missing values | Missing |
DPP4i-MET_f_date has 86 (86.0%) missing values | Missing |
DPP4i-MET_l_date has 90 (90.0%) missing values | Missing |
Insul_f_date has 61 (61.0%) missing values | Missing |
Insul_f_prcd has 61 (61.0%) missing values | Missing |
Insul_l_date has 64 (64.0%) missing values | Missing |
Insul_l_prcd has 64 (64.0%) missing values | Missing |
RID has unique values | Unique |
Reproduction
Analysis started | 2023-10-08 18:56:32.444150 |
---|---|
Analysis finished | 2023-10-08 18:56:33.046086 |
Duration | 0.6 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
SU_f_date
Date
MISSING
 
Distinct | 36 |
---|---|
Distinct (%) | 94.7% |
Missing | 62 |
Missing (%) | 62.0% |
Memory size | 932.0 B |
Minimum | 2015-09-04 00:00:00 |
---|---|
Maximum | 2020-04-10 00:00:00 |
SU_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 15.8% |
Missing | 62 |
Missing (%) | 62.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15757033 |
Minimum | 1597758 |
---|---|
Maximum | 21133671 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1597758 |
---|---|
5-th percentile | 1597769.9 |
Q1 | 19070280 |
median | 19101729 |
Q3 | 21133671 |
95-th percentile | 21133671 |
Maximum | 21133671 |
Range | 19535913 |
Interquartile range (IQR) | 2063391 |
Descriptive statistics
Standard deviation | 8044345.8 |
---|---|
Coefficient of variation (CV) | 0.51052414 |
Kurtosis | -0.4049996 |
Mean | 15757033 |
Median Absolute Deviation (MAD) | 2031942 |
Skewness | -1.2437383 |
Sum | 5.9876726 × 108 |
Variance | 6.4711499 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21133671 | 15 | 15.0% |
19101729 | 13 | 13.0% |
1597772 | 5 | 5.0% |
1597773 | 2 | 2.0% |
1597758 | 2 | 2.0% |
19059797 | 1 | 1.0% |
(Missing) | 62 |
Value | Count | Frequency (%) |
1597758 | 2 | 2.0% |
1597772 | 5 | 5.0% |
1597773 | 2 | 2.0% |
19059797 | 1 | 1.0% |
19101729 | 13 | |
21133671 | 15 |
Value | Count | Frequency (%) |
21133671 | 15 | |
19101729 | 13 | |
19059797 | 1 | 1.0% |
1597773 | 2 | 2.0% |
1597772 | 5 | 5.0% |
1597758 | 2 | 2.0% |
SU_l_date
Date
MISSING
 
Distinct | 34 |
---|---|
Distinct (%) | 97.1% |
Missing | 65 |
Missing (%) | 65.0% |
Memory size | 932.0 B |
Minimum | 2015-12-21 00:00:00 |
---|---|
Maximum | 2020-04-27 00:00:00 |
SU_l_prcd
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
21133671 | |
19101729 | |
19059797 | 3 |
DGMPD4 | 2 |
Other values (2) | 3 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 5.3 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 21133671 |
3rd row | 19101729 |
4th row | <NA> |
5th row | 21133671 |
Common Values
Value | Count | Frequency (%) |
<NA> | 65 | |
21133671 | 16 | 16.0% |
19101729 | 11 | 11.0% |
19059797 | 3 | 3.0% |
DGMPD4 | 2 | 2.0% |
DGMPD2 | 2 | 2.0% |
DGMPD3 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 65 | |
21133671 | 16 | 16.0% |
19101729 | 11 | 11.0% |
19059797 | 3 | 3.0% |
dgmpd4 | 2 | 2.0% |
dgmpd2 | 2 | 2.0% |
dgmpd3 | 1 | 1.0% |
SU-MET_f_date
Date
MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 100.0% |
Missing | 89 |
Missing (%) | 89.0% |
Memory size | 932.0 B |
Minimum | 2015-09-16 00:00:00 |
---|---|
Maximum | 2019-12-12 00:00:00 |
SU-MET_f_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42953740 | 6 |
42953917 | 3 |
42953698 | 2 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.44 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 89 | |
42953740 | 6 | 6.0% |
42953917 | 3 | 3.0% |
42953698 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 89 | |
42953740 | 6 | 6.0% |
42953917 | 3 | 3.0% |
42953698 | 2 | 2.0% |
SU-MET_l_date
Date
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 90 |
Missing (%) | 90.0% |
Memory size | 932.0 B |
Minimum | 2016-04-07 00:00:00 |
---|---|
Maximum | 2020-04-13 00:00:00 |
SU-MET_l_prcd
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42953740 | 8 |
42953917 | 2 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 90 | |
42953740 | 8 | 8.0% |
42953917 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 90 | |
42953740 | 8 | 8.0% |
42953917 | 2 | 2.0% |
Meg_f_date
Date
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 99 |
Missing (%) | 99.0% |
Memory size | 932.0 B |
Minimum | 2015-09-22 00:00:00 |
---|---|
Maximum | 2015-09-22 00:00:00 |
Meg_f_prcd
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
19023426 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.04 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 99 | |
19023426 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 99 | |
19023426 | 1 | 1.0% |
Meg_l_date
Date
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 99 |
Missing (%) | 99.0% |
Memory size | 932.0 B |
Minimum | 2016-06-13 00:00:00 |
---|---|
Maximum | 2016-06-13 00:00:00 |
Meg_l_prcd
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
19023426 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.04 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 99 | |
19023426 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 99 | |
19023426 | 1 | 1.0% |
Met_f_date
Date
MISSING
 
Distinct | 43 |
---|---|
Distinct (%) | 95.6% |
Missing | 55 |
Missing (%) | 55.0% |
Memory size | 932.0 B |
Minimum | 2015-09-07 00:00:00 |
---|---|
Maximum | 2020-04-10 00:00:00 |
Met_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 13.3% |
Missing | 55 |
Missing (%) | 55.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37825104 |
Minimum | 19106521 |
---|---|
Maximum | 40164946 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 19106521 |
---|---|
5-th percentile | 19106521 |
Q1 | 40164925 |
median | 40164929 |
Q3 | 40164929 |
95-th percentile | 40164946 |
Maximum | 40164946 |
Range | 21058425 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 6692800.8 |
---|---|
Coefficient of variation (CV) | 0.17694071 |
Kurtosis | 4.769103 |
Mean | 37825104 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -2.5610449 |
Sum | 1.7021297 × 109 |
Variance | 4.4793582 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40164929 | 15 | 15.0% |
40164925 | 10 | 10.0% |
40164946 | 9 | 9.0% |
19106521 | 5 | 5.0% |
40164897 | 5 | 5.0% |
40164894 | 1 | 1.0% |
(Missing) | 55 |
Value | Count | Frequency (%) |
19106521 | 5 | 5.0% |
40164894 | 1 | 1.0% |
40164897 | 5 | 5.0% |
40164925 | 10 | |
40164929 | 15 | |
40164946 | 9 |
Value | Count | Frequency (%) |
40164946 | 9 | |
40164929 | 15 | |
40164925 | 10 | |
40164897 | 5 | 5.0% |
40164894 | 1 | 1.0% |
19106521 | 5 | 5.0% |
Met_l_date
Date
MISSING
 
Distinct | 42 |
---|---|
Distinct (%) | 100.0% |
Missing | 58 |
Missing (%) | 58.0% |
Memory size | 932.0 B |
Minimum | 2015-11-09 00:00:00 |
---|---|
Maximum | 2020-04-27 00:00:00 |
Met_l_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 14.3% |
Missing | 58 |
Missing (%) | 58.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38159362 |
Minimum | 19106521 |
---|---|
Maximum | 40164946 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 19106521 |
---|---|
5-th percentile | 19106521 |
Q1 | 40164904 |
median | 40164929 |
Q3 | 40164929 |
95-th percentile | 40164946 |
Maximum | 40164946 |
Range | 21058425 |
Interquartile range (IQR) | 25 |
Descriptive statistics
Standard deviation | 6256488.7 |
---|---|
Coefficient of variation (CV) | 0.16395685 |
Kurtosis | 6.4923583 |
Mean | 38159362 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -2.8609726 |
Sum | 1.6026932 × 109 |
Variance | 3.9143651 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40164929 | 20 | 20.0% |
40164925 | 6 | 6.0% |
40164946 | 5 | 5.0% |
19106521 | 4 | 4.0% |
40164897 | 4 | 4.0% |
40164894 | 3 | 3.0% |
(Missing) | 58 |
Value | Count | Frequency (%) |
19106521 | 4 | 4.0% |
40164894 | 3 | 3.0% |
40164897 | 4 | 4.0% |
40164925 | 6 | 6.0% |
40164929 | 20 | |
40164946 | 5 | 5.0% |
Value | Count | Frequency (%) |
40164946 | 5 | 5.0% |
40164929 | 20 | |
40164925 | 6 | 6.0% |
40164897 | 4 | 4.0% |
40164894 | 3 | 3.0% |
19106521 | 4 | 4.0% |
TZD_f_date
Date
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | 100.0% |
Missing | 97 |
Missing (%) | 97.0% |
Memory size | 932.0 B |
Minimum | 2015-09-25 00:00:00 |
---|---|
Maximum | 2017-10-26 00:00:00 |
TZD_f_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
19079293 | 1 |
1525221 | 1 |
42960773 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.11 |
Min length | 4 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 3.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | 19079293 |
Common Values
Value | Count | Frequency (%) |
<NA> | 97 | |
19079293 | 1 | 1.0% |
1525221 | 1 | 1.0% |
42960773 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 97 | |
19079293 | 1 | 1.0% |
1525221 | 1 | 1.0% |
42960773 | 1 | 1.0% |
TZD_l_date
Date
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 100.0% |
Missing | 98 |
Missing (%) | 98.0% |
Memory size | 932.0 B |
Minimum | 2016-03-29 00:00:00 |
---|---|
Maximum | 2018-02-20 00:00:00 |
TZD_l_prcd
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
19079293 | 1 |
1525221 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.07 |
Min length | 4 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | 19079293 |
Common Values
Value | Count | Frequency (%) |
<NA> | 98 | |
19079293 | 1 | 1.0% |
1525221 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 98 | |
19079293 | 1 | 1.0% |
1525221 | 1 | 1.0% |
DPP4i_f_date
Date
MISSING
 
Distinct | 35 |
---|---|
Distinct (%) | 100.0% |
Missing | 65 |
Missing (%) | 65.0% |
Memory size | 932.0 B |
Minimum | 2015-09-03 00:00:00 |
---|---|
Maximum | 2020-03-23 00:00:00 |
DPP4i_f_prcd
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42961500 | |
40239218 | |
19125041 | 3 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 5.4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 40239218 |
4th row | <NA> |
5th row | 40239218 |
Common Values
Value | Count | Frequency (%) |
<NA> | 65 | |
42961500 | 20 | 20.0% |
40239218 | 12 | 12.0% |
19125041 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 65 | |
42961500 | 20 | 20.0% |
40239218 | 12 | 12.0% |
19125041 | 3 | 3.0% |
DPP4i_l_date
Date
MISSING
 
Distinct | 31 |
---|---|
Distinct (%) | 93.9% |
Missing | 67 |
Missing (%) | 67.0% |
Memory size | 932.0 B |
Minimum | 2016-04-07 00:00:00 |
---|---|
Maximum | 2020-05-02 00:00:00 |
DPP4i_l_prcd
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42961500 | |
40239218 | |
42960599 | 4 |
19125041 | 3 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 5.32 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 43013924 |
4th row | <NA> |
5th row | 40239218 |
Common Values
Value | Count | Frequency (%) |
<NA> | 67 | |
42961500 | 14 | 14.0% |
40239218 | 11 | 11.0% |
42960599 | 4 | 4.0% |
19125041 | 3 | 3.0% |
43013924 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 67 | |
42961500 | 14 | 14.0% |
40239218 | 11 | 11.0% |
42960599 | 4 | 4.0% |
19125041 | 3 | 3.0% |
43013924 | 1 | 1.0% |
DPP4i-MET_f_date
Date
MISSING
 
Distinct | 14 |
---|---|
Distinct (%) | 100.0% |
Missing | 86 |
Missing (%) | 86.0% |
Memory size | 932.0 B |
Minimum | 2015-09-17 00:00:00 |
---|---|
Maximum | 2019-10-31 00:00:00 |
DPP4i-MET_f_prcd
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
40164922 | |
42708088 | 3 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.56 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 86 | |
40164922 | 11 | 11.0% |
42708088 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 86 | |
40164922 | 11 | 11.0% |
42708088 | 3 | 3.0% |
DPP4i-MET_l_date
Date
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 90 |
Missing (%) | 90.0% |
Memory size | 932.0 B |
Minimum | 2015-10-21 00:00:00 |
---|---|
Maximum | 2019-02-27 00:00:00 |
DPP4i-MET_l_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
40164922 | 7 |
42708090 | 2 |
42708088 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.4 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 90 | |
40164922 | 7 | 7.0% |
42708090 | 2 | 2.0% |
42708088 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 90 | |
40164922 | 7 | 7.0% |
42708090 | 2 | 2.0% |
42708088 | 1 | 1.0% |
Insul_f_date
Date
MISSING
 
Distinct | 37 |
---|---|
Distinct (%) | 94.9% |
Missing | 61 |
Missing (%) | 61.0% |
Memory size | 932.0 B |
Minimum | 2015-09-03 00:00:00 |
---|---|
Maximum | 2020-04-17 00:00:00 |
Insul_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 17.9% |
Missing | 61 |
Missing (%) | 61.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37877863 |
Minimum | 35779361 |
---|---|
Maximum | 42921713 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 35779361 |
---|---|
5-th percentile | 35779361 |
Q1 | 35781503 |
median | 35781503 |
Q3 | 40755064 |
95-th percentile | 42921713 |
Maximum | 42921713 |
Range | 7142352 |
Interquartile range (IQR) | 4973561 |
Descriptive statistics
Standard deviation | 2779463.9 |
---|---|
Coefficient of variation (CV) | 0.073379639 |
Kurtosis | -1.3899984 |
Mean | 37877863 |
Median Absolute Deviation (MAD) | 2142 |
Skewness | 0.69917761 |
Sum | 1.4772367 × 109 |
Variance | 7.7254198 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
35781503 | 17 | 17.0% |
35779361 | 6 | 6.0% |
40755064 | 5 | 5.0% |
41348914 | 5 | 5.0% |
42921713 | 3 | 3.0% |
36809748 | 2 | 2.0% |
41370419 | 1 | 1.0% |
(Missing) | 61 |
Value | Count | Frequency (%) |
35779361 | 6 | 6.0% |
35781503 | 17 | |
36809748 | 2 | 2.0% |
40755064 | 5 | 5.0% |
41348914 | 5 | 5.0% |
41370419 | 1 | 1.0% |
42921713 | 3 | 3.0% |
Value | Count | Frequency (%) |
42921713 | 3 | 3.0% |
41370419 | 1 | 1.0% |
41348914 | 5 | 5.0% |
40755064 | 5 | 5.0% |
36809748 | 2 | 2.0% |
35781503 | 17 | |
35779361 | 6 | 6.0% |
Insul_l_date
Date
MISSING
 
Distinct | 35 |
---|---|
Distinct (%) | 97.2% |
Missing | 64 |
Missing (%) | 64.0% |
Memory size | 932.0 B |
Minimum | 2015-10-19 00:00:00 |
---|---|
Maximum | 2020-05-04 00:00:00 |
Insul_l_prcd
Real number (ℝ)
MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 25.0% |
Missing | 64 |
Missing (%) | 64.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38829948 |
Minimum | 35159339 |
---|---|
Maximum | 42921713 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 35159339 |
---|---|
5-th percentile | 35624356 |
Q1 | 35781503 |
median | 36809748 |
Q3 | 42920572 |
95-th percentile | 42921713 |
Maximum | 42921713 |
Range | 7762374 |
Interquartile range (IQR) | 7139069 |
Descriptive statistics
Standard deviation | 3309731.5 |
---|---|
Coefficient of variation (CV) | 0.085236568 |
Kurtosis | -1.8952458 |
Mean | 38829948 |
Median Absolute Deviation (MAD) | 1650409 |
Skewness | 0.22043654 |
Sum | 1.3978781 × 109 |
Variance | 1.0954322 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
35781503 | 13 | 13.0% |
42921713 | 8 | 8.0% |
40755064 | 4 | 4.0% |
42920572 | 3 | 3.0% |
35779361 | 2 | 2.0% |
35159339 | 2 | 2.0% |
36809748 | 2 | 2.0% |
41348914 | 1 | 1.0% |
40717097 | 1 | 1.0% |
(Missing) | 64 |
Value | Count | Frequency (%) |
35159339 | 2 | 2.0% |
35779361 | 2 | 2.0% |
35781503 | 13 | |
36809748 | 2 | 2.0% |
40717097 | 1 | 1.0% |
40755064 | 4 | 4.0% |
41348914 | 1 | 1.0% |
42920572 | 3 | 3.0% |
42921713 | 8 |
Value | Count | Frequency (%) |
42921713 | 8 | |
42920572 | 3 | 3.0% |
41348914 | 1 | 1.0% |
40755064 | 4 | 4.0% |
40717097 | 1 | 1.0% |
36809748 | 2 | 2.0% |
35781503 | 13 | |
35779361 | 2 | 2.0% |
35159339 | 2 | 2.0% |
RID | SU_f_date | SU_f_prcd | SU_l_date | SU_l_prcd | SU-MET_f_date | SU-MET_f_prcd | SU-MET_l_date | SU-MET_l_prcd | Meg_f_date | Meg_f_prcd | Meg_l_date | Meg_l_prcd | Met_f_date | Met_f_prcd | Met_l_date | Met_l_prcd | TZD_f_date | TZD_f_prcd | TZD_l_date | TZD_l_prcd | DPP4i_f_date | DPP4i_f_prcd | DPP4i_l_date | DPP4i_l_prcd | DPP4i-MET_f_date | DPP4i-MET_f_prcd | DPP4i-MET_l_date | DPP4i-MET_l_prcd | Insul_f_date | Insul_f_prcd | Insul_l_date | Insul_l_prcd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2019-10-16 | 19106521 | 2020-02-14 | 19106521 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2019-09-16 | 40755064 | 2020-02-14 | 40755064 |
1 | 2 | 2015-09-10 | 21133671 | 2019-02-26 | 21133671 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2016-01-21 | 40164897 | 2018-02-27 | 40164897 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | 3 | 2017-01-31 | 19101729 | 2017-04-04 | 19101729 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2016-12-29 | 40239218 | 2019-08-30 | 43013924 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | 4 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2016-08-19 | 40164946 | 2019-03-13 | 40164929 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2016-01-14 | 35781503 | 2016-01-14 | 35781503 |
4 | 5 | 2017-12-22 | 19101729 | 2019-05-03 | 21133671 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2015-10-12 | 40164946 | 2019-05-03 | 40164929 | 2017-10-26 | 19079293 | 2018-02-20 | 19079293 | 2015-10-12 | 40239218 | 2017-10-26 | 40239218 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | 6 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2015-11-16 | 40239218 | 2020-04-09 | 40239218 | <NA> | <NA> | <NA> | <NA> | 2015-11-09 | 35781503 | 2020-04-09 | 42921713 |
6 | 7 | 2020-03-16 | 19059797 | 2020-04-10 | 19059797 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2020-03-23 | 19125041 | 2020-04-10 | 19125041 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | 8 | 2015-09-25 | 21133671 | 2019-03-13 | 21133671 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2015-09-25 | 1525221 | 2016-03-29 | 1525221 | 2015-09-25 | 42961500 | 2020-03-24 | 42960599 | <NA> | <NA> | <NA> | <NA> | 2016-11-08 | 35779361 | 2020-03-24 | 42920572 |
8 | 9 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | 10 | 2019-09-04 | 21133671 | 2020-01-23 | 21133671 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
RID | SU_f_date | SU_f_prcd | SU_l_date | SU_l_prcd | SU-MET_f_date | SU-MET_f_prcd | SU-MET_l_date | SU-MET_l_prcd | Meg_f_date | Meg_f_prcd | Meg_l_date | Meg_l_prcd | Met_f_date | Met_f_prcd | Met_l_date | Met_l_prcd | TZD_f_date | TZD_f_prcd | TZD_l_date | TZD_l_prcd | DPP4i_f_date | DPP4i_f_prcd | DPP4i_l_date | DPP4i_l_prcd | DPP4i-MET_f_date | DPP4i-MET_f_prcd | DPP4i-MET_l_date | DPP4i-MET_l_prcd | Insul_f_date | Insul_f_prcd | Insul_l_date | Insul_l_prcd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 91 | 2016-03-09 | 1597758 | 2016-04-06 | DGMPD3 | 2015-09-16 | 42953740 | 2016-08-10 | 42953740 | <NA> | <NA> | <NA> | <NA> | 2017-02-20 | 40164929 | 2020-03-16 | 40164929 | <NA> | <NA> | <NA> | <NA> | 2015-09-16 | 40239218 | 2020-03-16 | 42960599 | <NA> | <NA> | <NA> | <NA> | 2017-02-06 | 36809748 | 2020-03-16 | 42921713 |
91 | 92 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
92 | 93 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2020-03-26 | 40164929 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2020-03-26 | 42921713 | 2020-04-25 | 42921713 |
93 | 94 | <NA> | <NA> | <NA> | <NA> | 2019-12-12 | 42953740 | 2020-01-08 | 42953740 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2019-12-12 | 42961500 | 2020-01-08 | 42961500 | <NA> | <NA> | <NA> | <NA> | 2019-12-23 | 35781503 | 2019-12-23 | 35781503 |
94 | 95 | <NA> | <NA> | <NA> | <NA> | 2016-05-23 | 42953698 | 2017-08-18 | 42953740 | <NA> | <NA> | <NA> | <NA> | 2017-08-28 | 40164929 | 2017-08-30 | 40164929 | <NA> | <NA> | <NA> | <NA> | 2017-08-26 | 42961500 | 2017-08-30 | 42961500 | <NA> | <NA> | <NA> | <NA> | 2015-09-07 | 40755064 | 2017-08-30 | 36809748 |
95 | 96 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
96 | 97 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2016-04-18 | 40164929 | 2016-05-23 | 40164929 | <NA> | <NA> | <NA> | <NA> | 2016-04-18 | 40239218 | 2016-05-23 | 40239218 | <NA> | <NA> | <NA> | <NA> | 2016-04-18 | 35779361 | 2016-05-23 | 35779361 |
97 | 98 | 2019-07-01 | 1597758 | 2020-03-26 | DGMPD4 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2019-07-01 | 40164897 | 2020-03-26 | 40164897 | <NA> | <NA> | <NA> | <NA> | 2019-07-01 | 42961500 | 2020-03-26 | 42961500 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
98 | 99 | 2015-09-22 | 1597772 | 2020-04-27 | DGMPD2 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2015-09-22 | 40164929 | 2020-04-27 | 40164929 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
99 | 100 | 2015-12-17 | 19101729 | 2017-02-04 | 19101729 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2016-01-27 | 42960773 | <NA> | <NA> | 2015-09-30 | 42961500 | 2017-02-17 | 40239218 | 2015-10-21 | 40164922 | 2015-10-21 | 40164922 | 2015-09-30 | 35779361 | 2017-02-20 | 35781503 |