Dataset statistics
Number of variables | 33 |
---|---|
Number of observations | 100 |
Missing cells | 1563 |
Missing cells (%) | 47.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 27.5 KiB |
Average record size in memory | 281.3 B |
Variable types
Text | 17 |
---|---|
Numeric | 7 |
Categorical | 9 |
Dataset
Description | 당뇨 환자의 처방 약물 코드와 최초 처방일과 최종 처방일. sulfonylurea (RxNorm 코드: 1597772, 1597758, 1597773, 19101729, 21133671, 19059797), sulfonylurea+metformin(42953698, 42953917, 42953740), meglitinide(19023425, 19023424, 19023426, 42962884, 19107111, 19107110, 1502829), metformin(19106521, 40164929, 40164946, 40164897, 40164894, 40164925), TZD(1525221, 19079293, 42960773), DPP4i(19125041, 40239218, 43013911, 43013924, 42960599, 42961500), DPP4i-MET(40164922, 42708088,42708090, 42708086), Insulin(46234044, 35782236, 35779361, 41348914, 35786039, 36809748, 42920572, 46234044, 41370419, 41349142, 46234044, 35782557, 35159339, 35781503, 35781503, 46234044, 46234044, 586875, 35781503, 35781503, 46234044, 41348508, 40717097 , 35779506, 40755064, 42921713) |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_pre |
SU-MET_f_prcd is highly imbalanced (64.1%) | Imbalance |
SU-MET_l_prcd is highly imbalanced (71.9%) | Imbalance |
Meg_f_prcd is highly imbalanced (64.3%) | Imbalance |
Meg_l_prcd is highly imbalanced (70.0%) | Imbalance |
TZD_f_prcd is highly imbalanced (70.1%) | Imbalance |
TZD_l_prcd is highly imbalanced (72.1%) | Imbalance |
DPP4i-MET_f_prcd is highly imbalanced (62.6%) | Imbalance |
DPP4i-MET_l_prcd is highly imbalanced (71.9%) | Imbalance |
SU_f_date has 62 (62.0%) missing values | Missing |
SU_f_prcd has 62 (62.0%) missing values | Missing |
SU_l_date has 73 (73.0%) missing values | Missing |
SU-MET_f_date has 90 (90.0%) missing values | Missing |
SU-MET_l_date has 92 (92.0%) missing values | Missing |
Meg_f_date has 85 (85.0%) missing values | Missing |
Meg_l_date has 88 (88.0%) missing values | Missing |
Met_f_date has 34 (34.0%) missing values | Missing |
Met_f_prcd has 34 (34.0%) missing values | Missing |
Met_l_date has 45 (45.0%) missing values | Missing |
Met_l_prcd has 45 (45.0%) missing values | Missing |
TZD_f_date has 90 (90.0%) missing values | Missing |
TZD_l_date has 91 (91.0%) missing values | Missing |
DPP4i_f_date has 54 (54.0%) missing values | Missing |
DPP4i_f_prcd has 54 (54.0%) missing values | Missing |
DPP4i_l_date has 64 (64.0%) missing values | Missing |
DPP4i_l_prcd has 64 (64.0%) missing values | Missing |
DPP4i-MET_f_date has 87 (87.0%) missing values | Missing |
DPP4i-MET_l_date has 91 (91.0%) missing values | Missing |
Insul_f_date has 62 (62.0%) missing values | Missing |
Insul_f_prcd has 62 (62.0%) missing values | Missing |
Insul_l_date has 67 (67.0%) missing values | Missing |
Insul_l_prcd has 67 (67.0%) missing values | Missing |
RID has unique values | Unique |
Reproduction
Analysis started | 2023-10-08 18:57:22.504108 |
---|---|
Analysis finished | 2023-10-08 18:57:23.959959 |
Duration | 1.46 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
r0000001 | 1 | 1.0% |
r0000063 | 1 | 1.0% |
r0000074 | 1 | 1.0% |
r0000073 | 1 | 1.0% |
r0000072 | 1 | 1.0% |
r0000071 | 1 | 1.0% |
r0000070 | 1 | 1.0% |
r0000069 | 1 | 1.0% |
r0000068 | 1 | 1.0% |
r0000067 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 519 | |
R | 100 | 12.5% |
1 | 21 | 2.6% |
3 | 20 | 2.5% |
4 | 20 | 2.5% |
5 | 20 | 2.5% |
6 | 20 | 2.5% |
7 | 20 | 2.5% |
8 | 20 | 2.5% |
9 | 20 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 700 | |
Uppercase Letter | 100 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 519 | |
1 | 21 | 3.0% |
3 | 20 | 2.9% |
4 | 20 | 2.9% |
5 | 20 | 2.9% |
6 | 20 | 2.9% |
7 | 20 | 2.9% |
8 | 20 | 2.9% |
9 | 20 | 2.9% |
2 | 20 | 2.9% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 700 | |
Latin | 100 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 519 | |
1 | 21 | 3.0% |
3 | 20 | 2.9% |
4 | 20 | 2.9% |
5 | 20 | 2.9% |
6 | 20 | 2.9% |
7 | 20 | 2.9% |
8 | 20 | 2.9% |
9 | 20 | 2.9% |
2 | 20 | 2.9% |
Latin
Value | Count | Frequency (%) |
R | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 519 | |
R | 100 | 12.5% |
1 | 21 | 2.6% |
3 | 20 | 2.5% |
4 | 20 | 2.5% |
5 | 20 | 2.5% |
6 | 20 | 2.5% |
7 | 20 | 2.5% |
8 | 20 | 2.5% |
9 | 20 | 2.5% |
SU_f_date
Text
MISSING
 
Distinct | 29 |
---|---|
Distinct (%) | 76.3% |
Missing | 62 |
Missing (%) | 62.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
sep-09 | 3 | 7.9% |
jul-09 | 3 | 7.9% |
mar-14 | 2 | 5.3% |
jan-12 | 2 | 5.3% |
dec-18 | 2 | 5.3% |
mar-19 | 2 | 5.3% |
feb-10 | 2 | 5.3% |
dec-14 | 1 | 2.6% |
dec-15 | 1 | 2.6% |
oct-17 | 1 | 2.6% |
Other values (19) | 19 |
Most occurring characters
Value | Count | Frequency (%) |
- | 38 | |
1 | 31 | 13.6% |
e | 12 | 5.3% |
0 | 11 | 4.8% |
9 | 10 | 4.4% |
u | 10 | 4.4% |
J | 8 | 3.5% |
c | 8 | 3.5% |
p | 7 | 3.1% |
a | 7 | 3.1% |
Other values (22) | 86 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 76 | |
Lowercase Letter | 76 | |
Dash Punctuation | 38 | |
Uppercase Letter | 38 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 12 | |
u | 10 | |
c | 8 | |
p | 7 | |
a | 7 | |
r | 7 | |
g | 4 | 5.3% |
t | 4 | 5.3% |
n | 4 | 5.3% |
l | 4 | 5.3% |
Other values (3) | 9 |
Decimal Number
Value | Count | Frequency (%) |
1 | 31 | |
0 | 11 | 14.5% |
9 | 10 | 13.2% |
8 | 6 | 7.9% |
4 | 6 | 7.9% |
2 | 3 | 3.9% |
5 | 3 | 3.9% |
3 | 3 | 3.9% |
7 | 2 | 2.6% |
6 | 1 | 1.3% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 8 | |
A | 6 | |
S | 5 | |
M | 5 | |
O | 4 | |
D | 4 | |
N | 3 | 7.9% |
F | 3 | 7.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 38 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 114 | |
Latin | 114 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 12 | 10.5% |
u | 10 | 8.8% |
J | 8 | 7.0% |
c | 8 | 7.0% |
p | 7 | 6.1% |
a | 7 | 6.1% |
r | 7 | 6.1% |
A | 6 | 5.3% |
S | 5 | 4.4% |
M | 5 | 4.4% |
Other values (11) | 39 |
Common
Value | Count | Frequency (%) |
- | 38 | |
1 | 31 | |
0 | 11 | 9.6% |
9 | 10 | 8.8% |
8 | 6 | 5.3% |
4 | 6 | 5.3% |
2 | 3 | 2.6% |
5 | 3 | 2.6% |
3 | 3 | 2.6% |
7 | 2 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 228 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 38 | |
1 | 31 | 13.6% |
e | 12 | 5.3% |
0 | 11 | 4.8% |
9 | 10 | 4.4% |
u | 10 | 4.4% |
J | 8 | 3.5% |
c | 8 | 3.5% |
p | 7 | 3.1% |
a | 7 | 3.1% |
Other values (22) | 86 |
SU_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 15.8% |
Missing | 62 |
Missing (%) | 62.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10155377 |
Minimum | 1597758 |
---|---|
Maximum | 21133671 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1597758 |
---|---|
5-th percentile | 1597772 |
Q1 | 1597772 |
median | 1597773 |
Q3 | 19101729 |
95-th percentile | 21133671 |
Maximum | 21133671 |
Range | 19535913 |
Interquartile range (IQR) | 17503957 |
Descriptive statistics
Standard deviation | 9163680.1 |
---|---|
Coefficient of variation (CV) | 0.9023476 |
Kurtosis | -2.0736725 |
Mean | 10155377 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.12532045 |
Sum | 3.8590433 × 108 |
Variance | 8.3973033 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1597772 | 16 | 16.0% |
19101729 | 12 | 12.0% |
21133671 | 5 | 5.0% |
1597773 | 3 | 3.0% |
19059797 | 1 | 1.0% |
1597758 | 1 | 1.0% |
(Missing) | 62 |
Value | Count | Frequency (%) |
1597758 | 1 | 1.0% |
1597772 | 16 | |
1597773 | 3 | 3.0% |
19059797 | 1 | 1.0% |
19101729 | 12 | |
21133671 | 5 | 5.0% |
Value | Count | Frequency (%) |
21133671 | 5 | 5.0% |
19101729 | 12 | |
19059797 | 1 | 1.0% |
1597773 | 3 | 3.0% |
1597772 | 16 | |
1597758 | 1 | 1.0% |
SU_l_date
Text
MISSING
 
Distinct | 21 |
---|---|
Distinct (%) | 77.8% |
Missing | 73 |
Missing (%) | 73.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
jul-18 | 3 | 11.1% |
apr-19 | 2 | 7.4% |
mar-17 | 2 | 7.4% |
dec-18 | 2 | 7.4% |
jun-19 | 2 | 7.4% |
mar-19 | 1 | 3.7% |
apr-13 | 1 | 3.7% |
dec-12 | 1 | 3.7% |
feb-19 | 1 | 3.7% |
jun-17 | 1 | 3.7% |
Other values (11) | 11 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 28 | |
- | 27 | |
J | 9 | 5.6% |
u | 8 | 4.9% |
a | 7 | 4.3% |
r | 7 | 4.3% |
9 | 6 | 3.7% |
e | 6 | 3.7% |
8 | 6 | 3.7% |
n | 5 | 3.1% |
Other values (20) | 53 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 54 | |
Lowercase Letter | 54 | |
Dash Punctuation | 27 | |
Uppercase Letter | 27 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 8 | |
a | 7 | |
r | 7 | |
e | 6 | |
n | 5 | |
l | 4 | |
p | 3 | 5.6% |
b | 3 | 5.6% |
v | 3 | 5.6% |
o | 3 | 5.6% |
Other values (3) | 5 |
Decimal Number
Value | Count | Frequency (%) |
1 | 28 | |
9 | 6 | 11.1% |
8 | 6 | 11.1% |
3 | 3 | 5.6% |
7 | 3 | 5.6% |
5 | 2 | 3.7% |
6 | 2 | 3.7% |
4 | 2 | 3.7% |
0 | 1 | 1.9% |
2 | 1 | 1.9% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 9 | |
M | 5 | |
A | 4 | |
F | 3 | 11.1% |
N | 3 | 11.1% |
D | 3 | 11.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 27 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 81 | |
Latin | 81 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
J | 9 | 11.1% |
u | 8 | 9.9% |
a | 7 | 8.6% |
r | 7 | 8.6% |
e | 6 | 7.4% |
n | 5 | 6.2% |
M | 5 | 6.2% |
A | 4 | 4.9% |
l | 4 | 4.9% |
p | 3 | 3.7% |
Other values (9) | 23 |
Common
Value | Count | Frequency (%) |
1 | 28 | |
- | 27 | |
9 | 6 | 7.4% |
8 | 6 | 7.4% |
3 | 3 | 3.7% |
7 | 3 | 3.7% |
5 | 2 | 2.5% |
6 | 2 | 2.5% |
4 | 2 | 2.5% |
0 | 1 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 162 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 28 | |
- | 27 | |
J | 9 | 5.6% |
u | 8 | 4.9% |
a | 7 | 4.3% |
r | 7 | 4.3% |
9 | 6 | 3.7% |
e | 6 | 3.7% |
8 | 6 | 3.7% |
n | 5 | 3.1% |
Other values (20) | 53 |
SU_l_prcd
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
1597772 | |
19101729 | 7 |
21133671 | 7 |
1597773 | 3 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.96 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 19101729 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 73 | |
1597772 | 9 | 9.0% |
19101729 | 7 | 7.0% |
21133671 | 7 | 7.0% |
1597773 | 3 | 3.0% |
19059797 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 73 | |
1597772 | 9 | 9.0% |
19101729 | 7 | 7.0% |
21133671 | 7 | 7.0% |
1597773 | 3 | 3.0% |
19059797 | 1 | 1.0% |
SU-MET_f_date
Text
MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 90.0% |
Missing | 90 |
Missing (%) | 90.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
apr-13 | 2 | |
feb-14 | 1 | |
jun-16 | 1 | |
feb-13 | 1 | |
nov-17 | 1 | |
jul-16 | 1 | |
dec-14 | 1 | |
jan-18 | 1 | |
jun-14 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
- | 10 | |
1 | 10 | |
J | 4 | 6.7% |
u | 3 | 5.0% |
3 | 3 | 5.0% |
n | 3 | 5.0% |
e | 3 | 5.0% |
4 | 3 | 5.0% |
6 | 2 | 3.3% |
p | 2 | 3.3% |
Other values (13) | 17 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20 | |
Lowercase Letter | 20 | |
Dash Punctuation | 10 | |
Uppercase Letter | 10 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 3 | |
n | 3 | |
e | 3 | |
p | 2 | |
b | 2 | |
r | 2 | |
o | 1 | 5.0% |
v | 1 | 5.0% |
l | 1 | 5.0% |
c | 1 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 10 | |
3 | 3 | 15.0% |
4 | 3 | 15.0% |
6 | 2 | 10.0% |
7 | 1 | 5.0% |
8 | 1 | 5.0% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 4 | |
A | 2 | |
F | 2 | |
N | 1 | 10.0% |
D | 1 | 10.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30 | |
Latin | 30 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
J | 4 | |
u | 3 | |
n | 3 | |
e | 3 | |
p | 2 | 6.7% |
A | 2 | 6.7% |
b | 2 | 6.7% |
F | 2 | 6.7% |
r | 2 | 6.7% |
N | 1 | 3.3% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
- | 10 | |
1 | 10 | |
3 | 3 | 10.0% |
4 | 3 | 10.0% |
6 | 2 | 6.7% |
7 | 1 | 3.3% |
8 | 1 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 60 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 10 | |
1 | 10 | |
J | 4 | 6.7% |
u | 3 | 5.0% |
3 | 3 | 5.0% |
n | 3 | 5.0% |
e | 3 | 5.0% |
4 | 3 | 5.0% |
6 | 2 | 3.3% |
p | 2 | 3.3% |
Other values (13) | 17 |
SU-MET_f_prcd
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42953917 | 5 |
42953740 | 5 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 90 | |
42953917 | 5 | 5.0% |
42953740 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 90 | |
42953917 | 5 | 5.0% |
42953740 | 5 | 5.0% |
SU-MET_l_date
Text
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 87.5% |
Missing | 92 |
Missing (%) | 92.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
feb-18 | 2 | |
aug-13 | 1 | |
sep-16 | 1 | |
dec-15 | 1 | |
dec-18 | 1 | |
may-18 | 1 | |
jun-19 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
- | 8 | |
1 | 8 | |
e | 5 | |
8 | 4 | 8.3% |
F | 2 | 4.2% |
b | 2 | 4.2% |
u | 2 | 4.2% |
D | 2 | 4.2% |
c | 2 | 4.2% |
5 | 1 | 2.1% |
Other values (12) | 12 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16 | |
Lowercase Letter | 16 | |
Dash Punctuation | 8 | |
Uppercase Letter | 8 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 5 | |
b | 2 | 12.5% |
u | 2 | 12.5% |
c | 2 | 12.5% |
n | 1 | 6.2% |
y | 1 | 6.2% |
a | 1 | 6.2% |
p | 1 | 6.2% |
g | 1 | 6.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 8 | |
8 | 4 | |
5 | 1 | 6.2% |
6 | 1 | 6.2% |
3 | 1 | 6.2% |
9 | 1 | 6.2% |
Uppercase Letter
Value | Count | Frequency (%) |
F | 2 | |
D | 2 | |
J | 1 | |
M | 1 | |
S | 1 | |
A | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 24 | |
Latin | 24 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 5 | |
F | 2 | 8.3% |
b | 2 | 8.3% |
u | 2 | 8.3% |
D | 2 | 8.3% |
c | 2 | 8.3% |
n | 1 | 4.2% |
J | 1 | 4.2% |
y | 1 | 4.2% |
a | 1 | 4.2% |
Other values (5) | 5 |
Common
Value | Count | Frequency (%) |
- | 8 | |
1 | 8 | |
8 | 4 | |
5 | 1 | 4.2% |
6 | 1 | 4.2% |
3 | 1 | 4.2% |
9 | 1 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 48 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 8 | |
1 | 8 | |
e | 5 | |
8 | 4 | 8.3% |
F | 2 | 4.2% |
b | 2 | 4.2% |
u | 2 | 4.2% |
D | 2 | 4.2% |
c | 2 | 4.2% |
5 | 1 | 2.1% |
Other values (12) | 12 |
SU-MET_l_prcd
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42953740 | 7 |
42953917 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.32 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 92 | |
42953740 | 7 | 7.0% |
42953917 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 92 | |
42953740 | 7 | 7.0% |
42953917 | 1 | 1.0% |
Meg_f_date
Text
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 66.7% |
Missing | 85 |
Missing (%) | 85.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
jul-09 | 3 | |
apr-10 | 2 | |
aug-09 | 2 | |
sep-09 | 2 | |
mar-10 | 1 | 6.7% |
aug-14 | 1 | 6.7% |
feb-12 | 1 | 6.7% |
may-12 | 1 | 6.7% |
nov-10 | 1 | 6.7% |
dec-11 | 1 | 6.7% |
Most occurring characters
Value | Count | Frequency (%) |
- | 15 | |
0 | 11 | |
1 | 9 | 10.0% |
9 | 7 | 7.8% |
u | 6 | 6.7% |
A | 5 | 5.6% |
e | 4 | 4.4% |
p | 4 | 4.4% |
g | 3 | 3.3% |
J | 3 | 3.3% |
Other values (15) | 23 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30 | |
Lowercase Letter | 30 | |
Dash Punctuation | 15 | |
Uppercase Letter | 15 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 6 | |
e | 4 | |
p | 4 | |
g | 3 | |
r | 3 | |
l | 3 | |
a | 2 | 6.7% |
b | 1 | 3.3% |
y | 1 | 3.3% |
o | 1 | 3.3% |
Other values (2) | 2 | 6.7% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 5 | |
J | 3 | |
S | 2 | 13.3% |
M | 2 | 13.3% |
F | 1 | 6.7% |
N | 1 | 6.7% |
D | 1 | 6.7% |
Decimal Number
Value | Count | Frequency (%) |
0 | 11 | |
1 | 9 | |
9 | 7 | |
2 | 2 | 6.7% |
4 | 1 | 3.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 15 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 45 | |
Latin | 45 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 6 | |
A | 5 | |
e | 4 | 8.9% |
p | 4 | 8.9% |
g | 3 | 6.7% |
J | 3 | 6.7% |
r | 3 | 6.7% |
l | 3 | 6.7% |
S | 2 | 4.4% |
M | 2 | 4.4% |
Other values (9) | 10 |
Common
Value | Count | Frequency (%) |
- | 15 | |
0 | 11 | |
1 | 9 | |
9 | 7 | |
2 | 2 | 4.4% |
4 | 1 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 15 | |
0 | 11 | |
1 | 9 | 10.0% |
9 | 7 | 7.8% |
u | 6 | 6.7% |
A | 5 | 5.6% |
e | 4 | 4.4% |
p | 4 | 4.4% |
g | 3 | 3.3% |
J | 3 | 3.3% |
Other values (15) | 23 |
Meg_f_prcd
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
19107111 | 5 |
19107110 | 5 |
42962884 | 2 |
1502829 | 2 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.58 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 85 | |
19107111 | 5 | 5.0% |
19107110 | 5 | 5.0% |
42962884 | 2 | 2.0% |
1502829 | 2 | 2.0% |
19023425 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 85 | |
19107111 | 5 | 5.0% |
19107110 | 5 | 5.0% |
42962884 | 2 | 2.0% |
1502829 | 2 | 2.0% |
19023425 | 1 | 1.0% |
Meg_l_date
Text
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 100.0% |
Missing | 88 |
Missing (%) | 88.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
mar-10 | 1 | |
oct-14 | 1 | |
jun-14 | 1 | |
nov-12 | 1 | |
jul-15 | 1 | |
feb-15 | 1 | |
oct-11 | 1 | |
oct-10 | 1 | |
aug-12 | 1 | |
jun-15 | 1 | |
Other values (2) | 2 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 13 | |
- | 12 | |
J | 4 | 5.6% |
u | 4 | 5.6% |
n | 3 | 4.2% |
2 | 3 | 4.2% |
O | 3 | 4.2% |
c | 3 | 4.2% |
t | 3 | 4.2% |
5 | 3 | 4.2% |
Other values (15) | 21 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 24 | |
Lowercase Letter | 24 | |
Dash Punctuation | 12 | |
Uppercase Letter | 12 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 4 | |
n | 3 | |
c | 3 | |
t | 3 | |
b | 2 | |
e | 2 | |
a | 2 | |
g | 1 | 4.2% |
l | 1 | 4.2% |
v | 1 | 4.2% |
Other values (2) | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 13 | |
2 | 3 | 12.5% |
5 | 3 | 12.5% |
4 | 2 | 8.3% |
0 | 2 | 8.3% |
3 | 1 | 4.2% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 4 | |
O | 3 | |
F | 2 | |
A | 1 | 8.3% |
M | 1 | 8.3% |
N | 1 | 8.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 12 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 36 | |
Latin | 36 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
J | 4 | |
u | 4 | |
n | 3 | 8.3% |
O | 3 | 8.3% |
c | 3 | 8.3% |
t | 3 | 8.3% |
b | 2 | 5.6% |
e | 2 | 5.6% |
a | 2 | 5.6% |
F | 2 | 5.6% |
Other values (8) | 8 |
Common
Value | Count | Frequency (%) |
1 | 13 | |
- | 12 | |
2 | 3 | 8.3% |
5 | 3 | 8.3% |
4 | 2 | 5.6% |
0 | 2 | 5.6% |
3 | 1 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 72 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 13 | |
- | 12 | |
J | 4 | 5.6% |
u | 4 | 5.6% |
n | 3 | 4.2% |
2 | 3 | 4.2% |
O | 3 | 4.2% |
c | 3 | 4.2% |
t | 3 | 4.2% |
5 | 3 | 4.2% |
Other values (15) | 21 |
Meg_l_prcd
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
1502829 | 5 |
19107110 | 3 |
19107111 | 2 |
42962884 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.43 |
Min length | 4 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 88 | |
1502829 | 5 | 5.0% |
19107110 | 3 | 3.0% |
19107111 | 2 | 2.0% |
42962884 | 1 | 1.0% |
19023425 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 88 | |
1502829 | 5 | 5.0% |
19107110 | 3 | 3.0% |
19107111 | 2 | 2.0% |
42962884 | 1 | 1.0% |
19023425 | 1 | 1.0% |
Met_f_date
Text
MISSING
 
Distinct | 41 |
---|---|
Distinct (%) | 62.1% |
Missing | 34 |
Missing (%) | 34.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
jul-09 | 9 | 13.6% |
sep-09 | 5 | 7.6% |
oct-09 | 5 | 7.6% |
aug-09 | 4 | 6.1% |
apr-10 | 3 | 4.5% |
mar-14 | 2 | 3.0% |
jul-10 | 2 | 3.0% |
jan-12 | 2 | 3.0% |
oct-14 | 2 | 3.0% |
nov-16 | 1 | 1.5% |
Other values (31) | 31 |
Most occurring characters
Value | Count | Frequency (%) |
- | 66 | |
1 | 45 | 11.4% |
0 | 35 | 8.8% |
9 | 26 | 6.6% |
u | 25 | 6.3% |
J | 23 | 5.8% |
l | 15 | 3.8% |
e | 14 | 3.5% |
p | 13 | 3.3% |
A | 13 | 3.3% |
Other values (22) | 121 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 132 | |
Lowercase Letter | 132 | |
Dash Punctuation | 66 | |
Uppercase Letter | 66 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 25 | |
l | 15 | |
e | 14 | |
p | 13 | |
c | 13 | |
a | 10 | 7.6% |
t | 9 | 6.8% |
r | 9 | 6.8% |
n | 8 | 6.1% |
g | 7 | 5.3% |
Other values (4) | 9 | 6.8% |
Decimal Number
Value | Count | Frequency (%) |
1 | 45 | |
0 | 35 | |
9 | 26 | |
4 | 10 | 7.6% |
2 | 6 | 4.5% |
5 | 3 | 2.3% |
6 | 3 | 2.3% |
7 | 2 | 1.5% |
8 | 2 | 1.5% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 23 | |
A | 13 | |
O | 9 | 13.6% |
S | 7 | 10.6% |
M | 5 | 7.6% |
D | 4 | 6.1% |
F | 3 | 4.5% |
N | 2 | 3.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 66 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 198 | |
Latin | 198 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 25 | |
J | 23 | |
l | 15 | 7.6% |
e | 14 | 7.1% |
p | 13 | 6.6% |
A | 13 | 6.6% |
c | 13 | 6.6% |
a | 10 | 5.1% |
t | 9 | 4.5% |
r | 9 | 4.5% |
Other values (12) | 54 |
Common
Value | Count | Frequency (%) |
- | 66 | |
1 | 45 | |
0 | 35 | |
9 | 26 | 13.1% |
4 | 10 | 5.1% |
2 | 6 | 3.0% |
5 | 3 | 1.5% |
6 | 3 | 1.5% |
7 | 2 | 1.0% |
8 | 2 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 396 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 66 | |
1 | 45 | 11.4% |
0 | 35 | 8.8% |
9 | 26 | 6.6% |
u | 25 | 6.3% |
J | 23 | 5.8% |
l | 15 | 3.8% |
e | 14 | 3.5% |
p | 13 | 3.3% |
A | 13 | 3.3% |
Other values (22) | 121 |
Met_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 9.1% |
Missing | 34 |
Missing (%) | 34.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37293325 |
Minimum | 19106521 |
---|---|
Maximum | 40164946 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 19106521 |
---|---|
5-th percentile | 19106521 |
Q1 | 40164897 |
median | 40164929 |
Q3 | 40164929 |
95-th percentile | 40164946 |
Maximum | 40164946 |
Range | 21058425 |
Interquartile range (IQR) | 32 |
Descriptive statistics
Standard deviation | 7282081.1 |
---|---|
Coefficient of variation (CV) | 0.195265 |
Kurtosis | 2.7875244 |
Mean | 37293325 |
Median Absolute Deviation (MAD) | 17 |
Skewness | -2.1688585 |
Sum | 2.4613595 × 109 |
Variance | 5.3028705 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40164929 | 20 | |
40164946 | 14 | |
40164925 | 12 | 12.0% |
40164897 | 10 | 10.0% |
19106521 | 9 | 9.0% |
40164894 | 1 | 1.0% |
(Missing) | 34 |
Value | Count | Frequency (%) |
19106521 | 9 | |
40164894 | 1 | 1.0% |
40164897 | 10 | |
40164925 | 12 | |
40164929 | 20 | |
40164946 | 14 |
Value | Count | Frequency (%) |
40164946 | 14 | |
40164929 | 20 | |
40164925 | 12 | |
40164897 | 10 | |
40164894 | 1 | 1.0% |
19106521 | 9 |
Met_l_date
Text
MISSING
 
Distinct | 37 |
---|---|
Distinct (%) | 67.3% |
Missing | 45 |
Missing (%) | 45.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
apr-19 | 5 | 9.1% |
may-19 | 4 | 7.3% |
feb-15 | 3 | 5.5% |
mar-12 | 3 | 5.5% |
aug-15 | 3 | 5.5% |
jun-19 | 3 | 5.5% |
feb-18 | 2 | 3.6% |
feb-19 | 2 | 3.6% |
mar-19 | 2 | 3.6% |
oct-13 | 1 | 1.8% |
Other values (27) | 27 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 61 | |
- | 55 | |
9 | 17 | 5.2% |
e | 15 | 4.5% |
u | 13 | 3.9% |
a | 13 | 3.9% |
J | 12 | 3.6% |
r | 12 | 3.6% |
A | 11 | 3.3% |
p | 10 | 3.0% |
Other values (23) | 111 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 110 | |
Lowercase Letter | 110 | |
Dash Punctuation | 55 | |
Uppercase Letter | 55 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 15 | |
u | 13 | |
a | 13 | |
r | 12 | |
p | 10 | |
n | 9 | |
b | 8 | |
c | 7 | |
y | 5 | 4.5% |
g | 4 | 3.6% |
Other values (4) | 14 |
Decimal Number
Value | Count | Frequency (%) |
1 | 61 | |
9 | 17 | 15.5% |
5 | 9 | 8.2% |
2 | 4 | 3.6% |
8 | 4 | 3.6% |
0 | 3 | 2.7% |
7 | 3 | 2.7% |
6 | 3 | 2.7% |
4 | 3 | 2.7% |
3 | 3 | 2.7% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 12 | |
A | 11 | |
M | 10 | |
F | 8 | |
D | 4 | 7.3% |
N | 4 | 7.3% |
O | 3 | 5.5% |
S | 3 | 5.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 55 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 165 | |
Latin | 165 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 15 | 9.1% |
u | 13 | 7.9% |
a | 13 | 7.9% |
J | 12 | 7.3% |
r | 12 | 7.3% |
A | 11 | 6.7% |
p | 10 | 6.1% |
M | 10 | 6.1% |
n | 9 | 5.5% |
F | 8 | 4.8% |
Other values (12) | 52 |
Common
Value | Count | Frequency (%) |
1 | 61 | |
- | 55 | |
9 | 17 | 10.3% |
5 | 9 | 5.5% |
2 | 4 | 2.4% |
8 | 4 | 2.4% |
0 | 3 | 1.8% |
7 | 3 | 1.8% |
6 | 3 | 1.8% |
4 | 3 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 330 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 61 | |
- | 55 | |
9 | 17 | 5.2% |
e | 15 | 4.5% |
u | 13 | 3.9% |
a | 13 | 3.9% |
J | 12 | 3.6% |
r | 12 | 3.6% |
A | 11 | 3.3% |
p | 10 | 3.0% |
Other values (23) | 111 |
Met_l_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 10.9% |
Missing | 45 |
Missing (%) | 45.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38633410 |
Minimum | 19106521 |
---|---|
Maximum | 40164946 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 19106521 |
---|---|
5-th percentile | 19106521 |
Q1 | 40164925 |
median | 40164929 |
Q3 | 40164946 |
95-th percentile | 40164946 |
Maximum | 40164946 |
Range | 21058425 |
Interquartile range (IQR) | 21 |
Descriptive statistics
Standard deviation | 5519025.8 |
---|---|
Coefficient of variation (CV) | 0.1428563 |
Kurtosis | 9.8044907 |
Mean | 38633410 |
Median Absolute Deviation (MAD) | 17 |
Skewness | -3.3836476 |
Sum | 2.1248375 × 109 |
Variance | 3.0459646 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40164929 | 20 | |
40164946 | 19 | |
40164897 | 6 | 6.0% |
40164925 | 5 | 5.0% |
19106521 | 4 | 4.0% |
40164894 | 1 | 1.0% |
(Missing) | 45 |
Value | Count | Frequency (%) |
19106521 | 4 | 4.0% |
40164894 | 1 | 1.0% |
40164897 | 6 | 6.0% |
40164925 | 5 | 5.0% |
40164929 | 20 | |
40164946 | 19 |
Value | Count | Frequency (%) |
40164946 | 19 | |
40164929 | 20 | |
40164925 | 5 | 5.0% |
40164897 | 6 | 6.0% |
40164894 | 1 | 1.0% |
19106521 | 4 | 4.0% |
TZD_f_date
Text
MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 90.0% |
Missing | 90 |
Missing (%) | 90.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
jun-16 | 2 | |
aug-17 | 1 | |
mar-19 | 1 | |
dec-17 | 1 | |
jan-14 | 1 | |
nov-17 | 1 | |
sep-15 | 1 | |
mar-16 | 1 | |
feb-12 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
- | 10 | |
1 | 10 | |
J | 3 | 5.0% |
a | 3 | 5.0% |
n | 3 | 5.0% |
6 | 3 | 5.0% |
e | 3 | 5.0% |
u | 3 | 5.0% |
7 | 3 | 5.0% |
r | 2 | 3.3% |
Other values (16) | 17 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20 | |
Lowercase Letter | 20 | |
Dash Punctuation | 10 | |
Uppercase Letter | 10 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 3 | |
n | 3 | |
e | 3 | |
u | 3 | |
r | 2 | |
o | 1 | 5.0% |
b | 1 | 5.0% |
p | 1 | 5.0% |
v | 1 | 5.0% |
c | 1 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 10 | |
6 | 3 | 15.0% |
7 | 3 | 15.0% |
5 | 1 | 5.0% |
4 | 1 | 5.0% |
9 | 1 | 5.0% |
2 | 1 | 5.0% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 3 | |
M | 2 | |
F | 1 | 10.0% |
S | 1 | 10.0% |
D | 1 | 10.0% |
N | 1 | 10.0% |
A | 1 | 10.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30 | |
Latin | 30 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
J | 3 | 10.0% |
a | 3 | 10.0% |
n | 3 | 10.0% |
e | 3 | 10.0% |
u | 3 | 10.0% |
r | 2 | 6.7% |
M | 2 | 6.7% |
o | 1 | 3.3% |
F | 1 | 3.3% |
b | 1 | 3.3% |
Other values (8) | 8 |
Common
Value | Count | Frequency (%) |
- | 10 | |
1 | 10 | |
6 | 3 | 10.0% |
7 | 3 | 10.0% |
5 | 1 | 3.3% |
4 | 1 | 3.3% |
9 | 1 | 3.3% |
2 | 1 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 60 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 10 | |
1 | 10 | |
J | 3 | 5.0% |
a | 3 | 5.0% |
n | 3 | 5.0% |
6 | 3 | 5.0% |
e | 3 | 5.0% |
u | 3 | 5.0% |
7 | 3 | 5.0% |
r | 2 | 3.3% |
Other values (16) | 17 |
TZD_f_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42960773 | 6 |
1525221 | 3 |
19079293 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.37 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | 42960773 |
Common Values
Value | Count | Frequency (%) |
<NA> | 90 | |
42960773 | 6 | 6.0% |
1525221 | 3 | 3.0% |
19079293 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 90 | |
42960773 | 6 | 6.0% |
1525221 | 3 | 3.0% |
19079293 | 1 | 1.0% |
TZD_l_date
Text
MISSING
 
Distinct | 8 |
---|---|
Distinct (%) | 88.9% |
Missing | 91 |
Missing (%) | 91.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
apr-19 | 2 | |
mar-19 | 1 | |
may-19 | 1 | |
may-18 | 1 | |
jun-19 | 1 | |
feb-18 | 1 | |
mar-16 | 1 | |
aug-14 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
- | 9 | |
1 | 9 | |
9 | 5 | |
r | 4 | 7.4% |
M | 4 | 7.4% |
a | 4 | 7.4% |
A | 3 | 5.6% |
p | 2 | 3.7% |
u | 2 | 3.7% |
8 | 2 | 3.7% |
Other values (9) | 10 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 18 | |
Lowercase Letter | 18 | |
Dash Punctuation | 9 | |
Uppercase Letter | 9 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
r | 4 | |
a | 4 | |
p | 2 | |
u | 2 | |
y | 2 | |
n | 1 | 5.6% |
e | 1 | 5.6% |
b | 1 | 5.6% |
g | 1 | 5.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 9 | |
9 | 5 | |
8 | 2 | 11.1% |
6 | 1 | 5.6% |
4 | 1 | 5.6% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 4 | |
A | 3 | |
J | 1 | 11.1% |
F | 1 | 11.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 27 | |
Latin | 27 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
r | 4 | |
M | 4 | |
a | 4 | |
A | 3 | |
p | 2 | |
u | 2 | |
y | 2 | |
J | 1 | 3.7% |
n | 1 | 3.7% |
F | 1 | 3.7% |
Other values (3) | 3 |
Common
Value | Count | Frequency (%) |
- | 9 | |
1 | 9 | |
9 | 5 | |
8 | 2 | 7.4% |
6 | 1 | 3.7% |
4 | 1 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 54 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 9 | |
1 | 9 | |
9 | 5 | |
r | 4 | 7.4% |
M | 4 | 7.4% |
a | 4 | 7.4% |
A | 3 | 5.6% |
p | 2 | 3.7% |
u | 2 | 3.7% |
8 | 2 | 3.7% |
Other values (9) | 10 |
TZD_l_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42960773 | 5 |
1525221 | 3 |
19079293 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.33 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | 19079293 |
Common Values
Value | Count | Frequency (%) |
<NA> | 91 | |
42960773 | 5 | 5.0% |
1525221 | 3 | 3.0% |
19079293 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 91 | |
42960773 | 5 | 5.0% |
1525221 | 3 | 3.0% |
19079293 | 1 | 1.0% |
DPP4i_f_date
Text
MISSING
 
Distinct | 39 |
---|---|
Distinct (%) | 84.8% |
Missing | 54 |
Missing (%) | 54.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
jul-09 | 3 | 6.5% |
oct-09 | 2 | 4.3% |
may-19 | 2 | 4.3% |
dec-18 | 2 | 4.3% |
aug-14 | 2 | 4.3% |
dec-13 | 2 | 4.3% |
sep-09 | 1 | 2.2% |
jan-12 | 1 | 2.2% |
mar-17 | 1 | 2.2% |
jun-14 | 1 | 2.2% |
Other values (29) | 29 |
Most occurring characters
Value | Count | Frequency (%) |
- | 46 | |
1 | 43 | |
u | 15 | 5.4% |
c | 15 | 5.4% |
e | 14 | 5.1% |
J | 12 | 4.3% |
0 | 9 | 3.3% |
9 | 8 | 2.9% |
4 | 8 | 2.9% |
O | 8 | 2.9% |
Other values (23) | 98 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 92 | |
Lowercase Letter | 92 | |
Dash Punctuation | 46 | |
Uppercase Letter | 46 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 15 | |
c | 15 | |
e | 14 | |
t | 8 | |
n | 6 | 6.5% |
l | 6 | 6.5% |
a | 5 | 5.4% |
p | 4 | 4.3% |
g | 4 | 4.3% |
b | 4 | 4.3% |
Other values (4) | 11 |
Decimal Number
Value | Count | Frequency (%) |
1 | 43 | |
0 | 9 | 9.8% |
9 | 8 | 8.7% |
4 | 8 | 8.7% |
8 | 6 | 6.5% |
6 | 5 | 5.4% |
3 | 4 | 4.3% |
7 | 4 | 4.3% |
5 | 4 | 4.3% |
2 | 1 | 1.1% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 12 | |
O | 8 | |
D | 7 | |
A | 5 | |
M | 4 | 8.7% |
F | 4 | 8.7% |
S | 3 | 6.5% |
N | 3 | 6.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 46 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 138 | |
Latin | 138 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 15 | 10.9% |
c | 15 | 10.9% |
e | 14 | 10.1% |
J | 12 | 8.7% |
O | 8 | 5.8% |
t | 8 | 5.8% |
D | 7 | 5.1% |
n | 6 | 4.3% |
l | 6 | 4.3% |
a | 5 | 3.6% |
Other values (12) | 42 |
Common
Value | Count | Frequency (%) |
- | 46 | |
1 | 43 | |
0 | 9 | 6.5% |
9 | 8 | 5.8% |
4 | 8 | 5.8% |
8 | 6 | 4.3% |
6 | 5 | 3.6% |
3 | 4 | 2.9% |
7 | 4 | 2.9% |
5 | 4 | 2.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 276 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 46 | |
1 | 43 | |
u | 15 | 5.4% |
c | 15 | 5.4% |
e | 14 | 5.1% |
J | 12 | 4.3% |
0 | 9 | 3.3% |
9 | 8 | 2.9% |
4 | 8 | 2.9% |
O | 8 | 2.9% |
Other values (23) | 98 |
DPP4i_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 13.0% |
Missing | 54 |
Missing (%) | 54.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 34428458 |
Minimum | 19125041 |
---|---|
Maximum | 43013924 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 19125041 |
---|---|
5-th percentile | 19125041 |
Q1 | 19125041 |
median | 40239218 |
Q3 | 42961500 |
95-th percentile | 43013924 |
Maximum | 43013924 |
Range | 23888883 |
Interquartile range (IQR) | 23836459 |
Descriptive statistics
Standard deviation | 10821404 |
---|---|
Coefficient of variation (CV) | 0.31431568 |
Kurtosis | -1.4813267 |
Mean | 34428458 |
Median Absolute Deviation (MAD) | 2748487.5 |
Skewness | -0.73212073 |
Sum | 1.5837091 × 109 |
Variance | 1.1710279 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19125041 | 15 | 15.0% |
40239218 | 13 | 13.0% |
42961500 | 7 | 7.0% |
43013924 | 4 | 4.0% |
43013911 | 4 | 4.0% |
42960599 | 3 | 3.0% |
(Missing) | 54 |
Value | Count | Frequency (%) |
19125041 | 15 | |
40239218 | 13 | |
42960599 | 3 | 3.0% |
42961500 | 7 | |
43013911 | 4 | 4.0% |
43013924 | 4 | 4.0% |
Value | Count | Frequency (%) |
43013924 | 4 | 4.0% |
43013911 | 4 | 4.0% |
42961500 | 7 | |
42960599 | 3 | 3.0% |
40239218 | 13 | |
19125041 | 15 |
DPP4i_l_date
Text
MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 66.7% |
Missing | 64 |
Missing (%) | 64.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
apr-19 | 4 | 11.1% |
mar-19 | 4 | 11.1% |
dec-18 | 3 | 8.3% |
aug-15 | 2 | 5.6% |
aug-18 | 2 | 5.6% |
feb-19 | 2 | 5.6% |
dec-15 | 2 | 5.6% |
jul-18 | 1 | 2.8% |
dec-14 | 1 | 2.8% |
may-17 | 1 | 2.8% |
Other values (14) | 14 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 37 | |
- | 36 | |
9 | 12 | 5.6% |
e | 11 | 5.1% |
r | 10 | 4.6% |
c | 9 | 4.2% |
u | 8 | 3.7% |
A | 8 | 3.7% |
a | 8 | 3.7% |
M | 8 | 3.7% |
Other values (21) | 69 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 72 | |
Lowercase Letter | 72 | |
Dash Punctuation | 36 | |
Uppercase Letter | 36 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 11 | |
r | 10 | |
c | 9 | |
u | 8 | |
a | 8 | |
p | 5 | |
g | 4 | 5.6% |
v | 3 | 4.2% |
o | 3 | 4.2% |
l | 3 | 4.2% |
Other values (4) | 8 |
Decimal Number
Value | Count | Frequency (%) |
1 | 37 | |
9 | 12 | 16.7% |
8 | 7 | 9.7% |
5 | 6 | 8.3% |
7 | 5 | 6.9% |
4 | 3 | 4.2% |
2 | 1 | 1.4% |
0 | 1 | 1.4% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 8 | |
M | 8 | |
D | 7 | |
J | 4 | |
N | 3 | 8.3% |
F | 3 | 8.3% |
O | 2 | 5.6% |
S | 1 | 2.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 36 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 108 | |
Latin | 108 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 11 | 10.2% |
r | 10 | 9.3% |
c | 9 | 8.3% |
u | 8 | 7.4% |
A | 8 | 7.4% |
a | 8 | 7.4% |
M | 8 | 7.4% |
D | 7 | 6.5% |
p | 5 | 4.6% |
g | 4 | 3.7% |
Other values (12) | 30 |
Common
Value | Count | Frequency (%) |
1 | 37 | |
- | 36 | |
9 | 12 | 11.1% |
8 | 7 | 6.5% |
5 | 6 | 5.6% |
7 | 5 | 4.6% |
4 | 3 | 2.8% |
2 | 1 | 0.9% |
0 | 1 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 216 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 37 | |
- | 36 | |
9 | 12 | 5.6% |
e | 11 | 5.1% |
r | 10 | 4.6% |
c | 9 | 4.2% |
u | 8 | 3.7% |
A | 8 | 3.7% |
a | 8 | 3.7% |
M | 8 | 3.7% |
Other values (21) | 69 |
DPP4i_l_prcd
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 16.7% |
Missing | 64 |
Missing (%) | 64.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 35007634 |
Minimum | 19125041 |
---|---|
Maximum | 43013924 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 19125041 |
---|---|
5-th percentile | 19125041 |
Q1 | 19125041 |
median | 40239218 |
Q3 | 42961500 |
95-th percentile | 43013924 |
Maximum | 43013924 |
Range | 23888883 |
Interquartile range (IQR) | 23836459 |
Descriptive statistics
Standard deviation | 10742649 |
---|---|
Coefficient of variation (CV) | 0.3068659 |
Kurtosis | -1.3105721 |
Mean | 35007634 |
Median Absolute Deviation (MAD) | 2748487.5 |
Skewness | -0.84582401 |
Sum | 1.2602748 × 109 |
Variance | 1.1540451 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19125041 | 11 | 11.0% |
40239218 | 9 | 9.0% |
42960599 | 5 | 5.0% |
42961500 | 4 | 4.0% |
43013924 | 4 | 4.0% |
43013911 | 3 | 3.0% |
(Missing) | 64 |
Value | Count | Frequency (%) |
19125041 | 11 | |
40239218 | 9 | |
42960599 | 5 | |
42961500 | 4 | 4.0% |
43013911 | 3 | 3.0% |
43013924 | 4 | 4.0% |
Value | Count | Frequency (%) |
43013924 | 4 | 4.0% |
43013911 | 3 | 3.0% |
42961500 | 4 | 4.0% |
42960599 | 5 | |
40239218 | 9 | |
19125041 | 11 |
DPP4i-MET_f_date
Text
MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 84.6% |
Missing | 87 |
Missing (%) | 87.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
dec-16 | 2 | |
oct-15 | 2 | |
feb-17 | 1 | |
mar-19 | 1 | |
nov-12 | 1 | |
nov-18 | 1 | |
jan-17 | 1 | |
jul-18 | 1 | |
mar-17 | 1 | |
mar-11 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 14 | |
- | 13 | |
c | 4 | 5.1% |
e | 4 | 5.1% |
a | 4 | 5.1% |
M | 3 | 3.8% |
r | 3 | 3.8% |
7 | 3 | 3.8% |
J | 2 | 2.6% |
8 | 2 | 2.6% |
Other values (15) | 26 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26 | |
Lowercase Letter | 26 | |
Dash Punctuation | 13 | |
Uppercase Letter | 13 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
c | 4 | |
e | 4 | |
a | 4 | |
r | 3 | |
v | 2 | |
o | 2 | |
b | 2 | |
t | 2 | |
n | 1 | 3.8% |
u | 1 | 3.8% |
Decimal Number
Value | Count | Frequency (%) |
1 | 14 | |
7 | 3 | 11.5% |
8 | 2 | 7.7% |
2 | 2 | 7.7% |
5 | 2 | 7.7% |
6 | 2 | 7.7% |
9 | 1 | 3.8% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 3 | |
J | 2 | |
N | 2 | |
D | 2 | |
F | 2 | |
O | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 13 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 39 | |
Latin | 39 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
c | 4 | 10.3% |
e | 4 | 10.3% |
a | 4 | 10.3% |
M | 3 | 7.7% |
r | 3 | 7.7% |
J | 2 | 5.1% |
v | 2 | 5.1% |
o | 2 | 5.1% |
N | 2 | 5.1% |
D | 2 | 5.1% |
Other values (7) | 11 |
Common
Value | Count | Frequency (%) |
1 | 14 | |
- | 13 | |
7 | 3 | 7.7% |
8 | 2 | 5.1% |
2 | 2 | 5.1% |
5 | 2 | 5.1% |
6 | 2 | 5.1% |
9 | 1 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 78 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 14 | |
- | 13 | |
c | 4 | 5.1% |
e | 4 | 5.1% |
a | 4 | 5.1% |
M | 3 | 3.8% |
r | 3 | 3.8% |
7 | 3 | 3.8% |
J | 2 | 2.6% |
8 | 2 | 2.6% |
Other values (15) | 26 |
DPP4i-MET_f_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
40164922 | 6 |
42708090 | 5 |
42708088 | 2 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.52 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 42708090 |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 87 | |
40164922 | 6 | 6.0% |
42708090 | 5 | 5.0% |
42708088 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 87 | |
40164922 | 6 | 6.0% |
42708090 | 5 | 5.0% |
42708088 | 2 | 2.0% |
DPP4i-MET_l_date
Text
MISSING
 
Distinct | 8 |
---|---|
Distinct (%) | 88.9% |
Missing | 91 |
Missing (%) | 91.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
jun-19 | 2 | |
nov-13 | 1 | |
mar-19 | 1 | |
nov-18 | 1 | |
apr-19 | 1 | |
jul-18 | 1 | |
oct-16 | 1 | |
aug-12 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
- | 9 | |
1 | 9 | |
u | 4 | 7.4% |
9 | 4 | 7.4% |
J | 3 | 5.6% |
A | 2 | 3.7% |
8 | 2 | 3.7% |
r | 2 | 3.7% |
v | 2 | 3.7% |
o | 2 | 3.7% |
Other values (13) | 15 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 18 | |
Lowercase Letter | 18 | |
Dash Punctuation | 9 | |
Uppercase Letter | 9 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 4 | |
r | 2 | |
v | 2 | |
o | 2 | |
n | 2 | |
g | 1 | 5.6% |
t | 1 | 5.6% |
c | 1 | 5.6% |
a | 1 | 5.6% |
l | 1 | 5.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 9 | |
9 | 4 | |
8 | 2 | 11.1% |
6 | 1 | 5.6% |
3 | 1 | 5.6% |
2 | 1 | 5.6% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 3 | |
A | 2 | |
N | 2 | |
O | 1 | 11.1% |
M | 1 | 11.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 27 | |
Latin | 27 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 4 | |
J | 3 | |
A | 2 | 7.4% |
r | 2 | 7.4% |
v | 2 | 7.4% |
o | 2 | 7.4% |
N | 2 | 7.4% |
n | 2 | 7.4% |
g | 1 | 3.7% |
O | 1 | 3.7% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
- | 9 | |
1 | 9 | |
9 | 4 | |
8 | 2 | 7.4% |
6 | 1 | 3.7% |
3 | 1 | 3.7% |
2 | 1 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 54 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 9 | |
1 | 9 | |
u | 4 | 7.4% |
9 | 4 | 7.4% |
J | 3 | 5.6% |
A | 2 | 3.7% |
8 | 2 | 3.7% |
r | 2 | 3.7% |
v | 2 | 3.7% |
o | 2 | 3.7% |
Other values (13) | 15 |
DPP4i-MET_l_prcd
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
42708090 | 4 |
40164922 | 4 |
42708088 | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.36 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 42708090 |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 91 | |
42708090 | 4 | 4.0% |
40164922 | 4 | 4.0% |
42708088 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 91 | |
42708090 | 4 | 4.0% |
40164922 | 4 | 4.0% |
42708088 | 1 | 1.0% |
Insul_f_date
Text
MISSING
 
Distinct | 33 |
---|---|
Distinct (%) | 86.8% |
Missing | 62 |
Missing (%) | 62.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
oct-11 | 2 | 5.3% |
aug-09 | 2 | 5.3% |
apr-10 | 2 | 5.3% |
may-11 | 2 | 5.3% |
jul-09 | 2 | 5.3% |
mar-16 | 1 | 2.6% |
aug-13 | 1 | 2.6% |
jun-17 | 1 | 2.6% |
may-15 | 1 | 2.6% |
oct-13 | 1 | 2.6% |
Other values (23) | 23 |
Most occurring characters
Value | Count | Frequency (%) |
- | 38 | |
1 | 34 | |
0 | 13 | 5.7% |
u | 12 | 5.3% |
9 | 9 | 3.9% |
e | 8 | 3.5% |
J | 8 | 3.5% |
c | 8 | 3.5% |
A | 8 | 3.5% |
a | 8 | 3.5% |
Other values (23) | 82 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 76 | |
Lowercase Letter | 76 | |
Dash Punctuation | 38 | |
Uppercase Letter | 38 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 12 | |
e | 8 | |
c | 8 | |
a | 8 | |
y | 6 | |
g | 5 | |
l | 5 | |
p | 5 | |
t | 5 | |
r | 4 | 5.3% |
Other values (4) | 10 |
Decimal Number
Value | Count | Frequency (%) |
1 | 34 | |
0 | 13 | 17.1% |
9 | 9 | 11.8% |
8 | 4 | 5.3% |
4 | 4 | 5.3% |
3 | 4 | 5.3% |
7 | 3 | 3.9% |
6 | 2 | 2.6% |
5 | 2 | 2.6% |
2 | 1 | 1.3% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 8 | |
A | 8 | |
M | 7 | |
O | 5 | |
D | 3 | 7.9% |
F | 3 | 7.9% |
S | 2 | 5.3% |
N | 2 | 5.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 38 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 114 | |
Latin | 114 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 12 | 10.5% |
e | 8 | 7.0% |
J | 8 | 7.0% |
c | 8 | 7.0% |
A | 8 | 7.0% |
a | 8 | 7.0% |
M | 7 | 6.1% |
y | 6 | 5.3% |
O | 5 | 4.4% |
g | 5 | 4.4% |
Other values (12) | 39 |
Common
Value | Count | Frequency (%) |
- | 38 | |
1 | 34 | |
0 | 13 | 11.4% |
9 | 9 | 7.9% |
8 | 4 | 3.5% |
4 | 4 | 3.5% |
3 | 4 | 3.5% |
7 | 3 | 2.6% |
6 | 2 | 1.8% |
5 | 2 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 228 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 38 | |
1 | 34 | |
0 | 13 | 5.7% |
u | 12 | 5.3% |
9 | 9 | 3.9% |
e | 8 | 3.5% |
J | 8 | 3.5% |
c | 8 | 3.5% |
A | 8 | 3.5% |
a | 8 | 3.5% |
Other values (23) | 82 |
Insul_f_prcd
Real number (ℝ)
MISSING
 
Distinct | 8 |
---|---|
Distinct (%) | 21.1% |
Missing | 62 |
Missing (%) | 62.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36295127 |
Minimum | 586875 |
---|---|
Maximum | 41348914 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 586875 |
---|---|
5-th percentile | 35686358 |
Q1 | 35779361 |
median | 35781503 |
Q3 | 39768735 |
95-th percentile | 41348914 |
Maximum | 41348914 |
Range | 40762039 |
Interquartile range (IQR) | 3989374 |
Descriptive statistics
Standard deviation | 6401141 |
---|---|
Coefficient of variation (CV) | 0.17636365 |
Kurtosis | 27.628311 |
Mean | 36295127 |
Median Absolute Deviation (MAD) | 2142 |
Skewness | -4.8300189 |
Sum | 1.3792148 × 109 |
Variance | 4.0974605 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
35781503 | 13 | 13.0% |
35779361 | 10 | 10.0% |
41348914 | 6 | 6.0% |
40755064 | 4 | 4.0% |
36809748 | 2 | 2.0% |
35782236 | 1 | 1.0% |
586875 | 1 | 1.0% |
35159339 | 1 | 1.0% |
(Missing) | 62 |
Value | Count | Frequency (%) |
586875 | 1 | 1.0% |
35159339 | 1 | 1.0% |
35779361 | 10 | |
35781503 | 13 | |
35782236 | 1 | 1.0% |
36809748 | 2 | 2.0% |
40755064 | 4 | 4.0% |
41348914 | 6 |
Value | Count | Frequency (%) |
41348914 | 6 | |
40755064 | 4 | 4.0% |
36809748 | 2 | 2.0% |
35782236 | 1 | 1.0% |
35781503 | 13 | |
35779361 | 10 | |
35159339 | 1 | 1.0% |
586875 | 1 | 1.0% |
Insul_l_date
Text
MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 72.7% |
Missing | 67 |
Missing (%) | 67.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
may-19 | 3 | 9.1% |
jun-19 | 3 | 9.1% |
apr-19 | 3 | 9.1% |
aug-15 | 2 | 6.1% |
oct-13 | 2 | 6.1% |
jul-18 | 2 | 6.1% |
dec-16 | 1 | 3.0% |
nov-10 | 1 | 3.0% |
jan-15 | 1 | 3.0% |
feb-19 | 1 | 3.0% |
Other values (14) | 14 |
Most occurring characters
Value | Count | Frequency (%) |
- | 33 | |
1 | 33 | |
9 | 11 | 5.6% |
u | 9 | 4.5% |
A | 9 | 4.5% |
J | 8 | 4.0% |
a | 8 | 4.0% |
r | 8 | 4.0% |
p | 7 | 3.5% |
M | 6 | 3.0% |
Other values (23) | 66 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 66 | |
Lowercase Letter | 66 | |
Dash Punctuation | 33 | |
Uppercase Letter | 33 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 9 | |
a | 8 | |
r | 8 | |
p | 7 | |
n | 5 | |
c | 5 | |
e | 5 | |
y | 4 | |
v | 3 | 4.5% |
g | 3 | 4.5% |
Other values (4) | 9 |
Decimal Number
Value | Count | Frequency (%) |
1 | 33 | |
9 | 11 | 16.7% |
5 | 5 | 7.6% |
0 | 4 | 6.1% |
2 | 3 | 4.5% |
8 | 3 | 4.5% |
3 | 3 | 4.5% |
7 | 2 | 3.0% |
6 | 1 | 1.5% |
4 | 1 | 1.5% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9 | |
J | 8 | |
M | 6 | |
D | 3 | 9.1% |
N | 3 | 9.1% |
O | 2 | 6.1% |
S | 1 | 3.0% |
F | 1 | 3.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 33 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 99 | |
Latin | 99 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 9 | 9.1% |
A | 9 | 9.1% |
J | 8 | 8.1% |
a | 8 | 8.1% |
r | 8 | 8.1% |
p | 7 | 7.1% |
M | 6 | 6.1% |
n | 5 | 5.1% |
c | 5 | 5.1% |
e | 5 | 5.1% |
Other values (12) | 29 |
Common
Value | Count | Frequency (%) |
- | 33 | |
1 | 33 | |
9 | 11 | 11.1% |
5 | 5 | 5.1% |
0 | 4 | 4.0% |
2 | 3 | 3.0% |
8 | 3 | 3.0% |
3 | 3 | 3.0% |
7 | 2 | 2.0% |
6 | 1 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 198 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 33 | |
1 | 33 | |
9 | 11 | 5.6% |
u | 9 | 4.5% |
A | 9 | 4.5% |
J | 8 | 4.0% |
a | 8 | 4.0% |
r | 8 | 4.0% |
p | 7 | 3.5% |
M | 6 | 3.0% |
Other values (23) | 66 |
Insul_l_prcd
Real number (ℝ)
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 30.3% |
Missing | 67 |
Missing (%) | 67.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36090901 |
Minimum | 586875 |
---|---|
Maximum | 42920572 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 586875 |
---|---|
5-th percentile | 35159339 |
Q1 | 35779361 |
median | 35781503 |
Q3 | 36809748 |
95-th percentile | 41977714 |
Maximum | 42920572 |
Range | 42333697 |
Interquartile range (IQR) | 1030387 |
Descriptive statistics
Standard deviation | 6895649.4 |
---|---|
Coefficient of variation (CV) | 0.19106338 |
Kurtosis | 23.336394 |
Mean | 36090901 |
Median Absolute Deviation (MAD) | 622164 |
Skewness | -4.3923422 |
Sum | 1.1909997 × 109 |
Variance | 4.7549981 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
35781503 | 12 | 12.0% |
35159339 | 6 | 6.0% |
41349142 | 4 | 4.0% |
35779361 | 3 | 3.0% |
42920572 | 2 | 2.0% |
36809748 | 2 | 2.0% |
40755064 | 1 | 1.0% |
41348914 | 1 | 1.0% |
35779506 | 1 | 1.0% |
586875 | 1 | 1.0% |
(Missing) | 67 |
Value | Count | Frequency (%) |
586875 | 1 | 1.0% |
35159339 | 6 | |
35779361 | 3 | 3.0% |
35779506 | 1 | 1.0% |
35781503 | 12 | |
36809748 | 2 | 2.0% |
40755064 | 1 | 1.0% |
41348914 | 1 | 1.0% |
41349142 | 4 | 4.0% |
42920572 | 2 | 2.0% |
Value | Count | Frequency (%) |
42920572 | 2 | 2.0% |
41349142 | 4 | 4.0% |
41348914 | 1 | 1.0% |
40755064 | 1 | 1.0% |
36809748 | 2 | 2.0% |
35781503 | 12 | |
35779506 | 1 | 1.0% |
35779361 | 3 | 3.0% |
35159339 | 6 | |
586875 | 1 | 1.0% |
RID | SU_f_date | SU_f_prcd | SU_l_date | SU_l_prcd | SU-MET_f_date | SU-MET_f_prcd | SU-MET_l_date | SU-MET_l_prcd | Meg_f_date | Meg_f_prcd | Meg_l_date | Meg_l_prcd | Met_f_date | Met_f_prcd | Met_l_date | Met_l_prcd | TZD_f_date | TZD_f_prcd | TZD_l_date | TZD_l_prcd | DPP4i_f_date | DPP4i_f_prcd | DPP4i_l_date | DPP4i_l_prcd | DPP4i-MET_f_date | DPP4i-MET_f_prcd | DPP4i-MET_l_date | DPP4i-MET_l_prcd | Insul_f_date | Insul_f_prcd | Insul_l_date | Insul_l_prcd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | R0000001 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Apr-17 | 19106521 | Feb-18 | 19106521 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1 | R0000002 | Sep-09 | 19101729 | Mar-16 | 19101729 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | R0000003 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Feb-14 | 40164946 | Feb-15 | 40164946 | <NA> | <NA> | <NA> | <NA> | Feb-14 | 40239218 | Feb-15 | 40239218 | Feb-17 | 42708090 | Jun-19 | 42708090 | <NA> | <NA> | <NA> | <NA> |
3 | R0000004 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Oct-11 | 40164929 | Mar-12 | 40164929 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Oct-11 | 41348914 | Mar-12 | 35159339 |
4 | R0000005 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jun-16 | 42960773 | Mar-19 | 19079293 | Dec-17 | 40239218 | Mar-19 | 40239218 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | R0000006 | Aug-14 | 19101729 | Jun-19 | 21133671 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jul-10 | 40164929 | Aug-15 | 40164897 | <NA> | <NA> | <NA> | <NA> | Nov-10 | 19125041 | Aug-15 | 19125041 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | R0000007 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | R0000008 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jul-14 | 40164946 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jul-14 | 35779361 | <NA> | <NA> |
8 | R0000009 | Feb-12 | 19101729 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Aug-09 | 19106521 | Nov-11 | 19106521 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | R0000010 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jul-09 | 19107111 | Mar-10 | 19107111 | Jul-09 | 40164929 | May-19 | 40164946 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
RID | SU_f_date | SU_f_prcd | SU_l_date | SU_l_prcd | SU-MET_f_date | SU-MET_f_prcd | SU-MET_l_date | SU-MET_l_prcd | Meg_f_date | Meg_f_prcd | Meg_l_date | Meg_l_prcd | Met_f_date | Met_f_prcd | Met_l_date | Met_l_prcd | TZD_f_date | TZD_f_prcd | TZD_l_date | TZD_l_prcd | DPP4i_f_date | DPP4i_f_prcd | DPP4i_l_date | DPP4i_l_prcd | DPP4i-MET_f_date | DPP4i-MET_f_prcd | DPP4i-MET_l_date | DPP4i-MET_l_prcd | Insul_f_date | Insul_f_prcd | Insul_l_date | Insul_l_prcd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | R0000091 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jul-09 | 40164925 | Oct-11 | 40164946 | <NA> | <NA> | <NA> | <NA> | Jul-09 | 19125041 | Oct-11 | 19125041 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
91 | R0000092 | Sep-14 | 19101729 | Nov-14 | 19101729 | Jun-14 | 42953917 | Jun-19 | 42953740 | <NA> | <NA> | <NA> | <NA> | Jun-14 | 40164929 | Dec-16 | 40164929 | Mar-16 | 19079293 | <NA> | <NA> | Jun-14 | 40239218 | May-17 | 42960599 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
92 | R0000093 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | May-19 | 40164929 | May-19 | 40164929 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
93 | R0000094 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
94 | R0000095 | Jun-13 | 1597772 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Feb-12 | 40164922 | Aug-12 | 40164922 | <NA> | <NA> | <NA> | <NA> |
95 | R0000096 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
96 | R0000097 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Jan-15 | 40164929 | Feb-19 | 40164929 | <NA> | <NA> | <NA> | <NA> | Mar-17 | 43013911 | Feb-19 | 43013911 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
97 | R0000098 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Sep-09 | 1502829 | Feb-12 | 1502829 | Sep-09 | 40164946 | Nov-17 | 40164946 | Feb-12 | 1525221 | Aug-14 | 1525221 | Aug-14 | 40239218 | Nov-17 | 40239218 | <NA> | <NA> | <NA> | <NA> | Dec-09 | 35781503 | Dec-09 | 35781503 |
98 | R0000099 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Aug-09 | 19107111 | <NA> | <NA> | Dec-09 | 40164946 | Jul-11 | 40164946 | <NA> | <NA> | <NA> | <NA> | Nov-11 | 19125041 | Mar-19 | 43013911 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
99 | R0000100 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | Oct-12 | 40164946 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |