Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.6 KiB |
Average record size in memory | 77.3 B |
Variable types
Text | 1 |
---|---|
Unsupported | 4 |
Numeric | 4 |
Dataset
Description | 알코올 사용 장애 환자들이 시행한 혈액 검사 중에 간기능의 효과를 평가할 수 있는 주요 검사 데이터를 포함하며 검체 채취 일자와 접수 일자를 이용하여 처방시점으로 부터의 기간을 계산한 시점 데이터를 생성함. 검사항목은AST(GOT), ALT(GPT), ALP, γ-GTP 등 간기능 개선 성과와 알코올 사용을 평가할 수 있는 주요 검사항목이 포함됨 - AST(Aspartate aminotransferase. GOT(Glutamic Oxalacetic Transaminase)), ALT(alanine aminotransferase, GPT(glutamic pyruvate transaminase)) : 간세포 손상을 반영하는 아미노전이효소(Aminotransferases)로 기본적인 간기능검사 항목임 - ALP(alkaline phosphatase, 알칼리인산분해효소) : 간세포 내 담관에 존재하는 효소로 즈로 담즙 배설 장애 시 빠르게 상승함 - γ-GTP(gamma(γ)-glutamyl transferase, GGT, 감마-글루타밀전이효소) : 간세포 내 담관에 존재하는 효소로 ALP와 함께 담즙 배설 장애를 판단하는데 사용되나, 간질환 없이도 알코올 중독자, 비만한 사람의 일부, 아세트아미노펜, 페니토인, 카르바마제핀 같은 약물의 과다복용 때도 상승할 수 있음 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/main-effect-blood-test-data-alcohol-use-disorder |
AST_SRC is highly overall correlated with ALT_SRC and 2 other fields | High correlation |
ALT_SRC is highly overall correlated with AST_SRC | High correlation |
ALP_SRC is highly overall correlated with AST_SRC | High correlation |
GTP_SRC is highly overall correlated with AST_SRC | High correlation |
RID has unique values | Unique |
AST_DCT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ALT_DCT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ALP_DCT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
GTP_DCT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-10-08 18:56:19.576207 |
---|---|
Analysis finished | 2023-10-08 18:56:23.238689 |
Duration | 3.66 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
r0000001 | 1 | 1.0% |
r0000078 | 1 | 1.0% |
r0000090 | 1 | 1.0% |
r0000089 | 1 | 1.0% |
r0000088 | 1 | 1.0% |
r0000086 | 1 | 1.0% |
r0000085 | 1 | 1.0% |
r0000084 | 1 | 1.0% |
r0000083 | 1 | 1.0% |
r0000082 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 508 | |
R | 100 | 12.5% |
1 | 47 | 5.9% |
3 | 21 | 2.6% |
6 | 20 | 2.5% |
8 | 19 | 2.4% |
9 | 19 | 2.4% |
7 | 17 | 2.1% |
2 | 17 | 2.1% |
5 | 16 | 2.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 700 | |
Uppercase Letter | 100 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 508 | |
1 | 47 | 6.7% |
3 | 21 | 3.0% |
6 | 20 | 2.9% |
8 | 19 | 2.7% |
9 | 19 | 2.7% |
7 | 17 | 2.4% |
2 | 17 | 2.4% |
5 | 16 | 2.3% |
4 | 16 | 2.3% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 700 | |
Latin | 100 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 508 | |
1 | 47 | 6.7% |
3 | 21 | 3.0% |
6 | 20 | 2.9% |
8 | 19 | 2.7% |
9 | 19 | 2.7% |
7 | 17 | 2.4% |
2 | 17 | 2.4% |
5 | 16 | 2.3% |
4 | 16 | 2.3% |
Latin
Value | Count | Frequency (%) |
R | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 508 | |
R | 100 | 12.5% |
1 | 47 | 5.9% |
3 | 21 | 2.6% |
6 | 20 | 2.5% |
8 | 19 | 2.4% |
9 | 19 | 2.4% |
7 | 17 | 2.1% |
2 | 17 | 2.1% |
5 | 16 | 2.0% |
AST_DCT
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 932.0 B |
AST_SRC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 66 |
---|---|
Distinct (%) | 66.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 81.72 |
Minimum | 15 |
---|---|
Maximum | 1085 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 19 |
Q1 | 27.75 |
median | 41.5 |
Q3 | 78 |
95-th percentile | 212.25 |
Maximum | 1085 |
Range | 1070 |
Interquartile range (IQR) | 50.25 |
Descriptive statistics
Standard deviation | 137.1877 |
---|---|
Coefficient of variation (CV) | 1.6787531 |
Kurtosis | 34.099658 |
Mean | 81.72 |
Median Absolute Deviation (MAD) | 19 |
Skewness | 5.396802 |
Sum | 8172 |
Variance | 18820.466 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
33 | 4 | 4.0% |
28 | 4 | 4.0% |
30 | 3 | 3.0% |
24 | 3 | 3.0% |
23 | 3 | 3.0% |
27 | 3 | 3.0% |
19 | 3 | 3.0% |
32 | 3 | 3.0% |
20 | 3 | 3.0% |
22 | 2 | 2.0% |
Other values (56) | 69 |
Value | Count | Frequency (%) |
15 | 1 | 1.0% |
16 | 1 | 1.0% |
17 | 1 | 1.0% |
19 | 3 | |
20 | 3 | |
21 | 2 | |
22 | 2 | |
23 | 3 | |
24 | 3 | |
25 | 1 | 1.0% |
Value | Count | Frequency (%) |
1085 | 1 | |
752 | 1 | |
380 | 1 | |
309 | 1 | |
236 | 1 | |
211 | 1 | |
207 | 1 | |
187 | 1 | |
176 | 1 | |
174 | 1 |
ALT_DCT
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 932.0 B |
ALT_SRC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 66 |
---|---|
Distinct (%) | 66.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 62.36 |
Minimum | 8 |
---|---|
Maximum | 471 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 8 |
---|---|
5-th percentile | 13 |
Q1 | 22 |
median | 41 |
Q3 | 70.5 |
95-th percentile | 156.85 |
Maximum | 471 |
Range | 463 |
Interquartile range (IQR) | 48.5 |
Descriptive statistics
Standard deviation | 71.969764 |
---|---|
Coefficient of variation (CV) | 1.1541014 |
Kurtosis | 19.259367 |
Mean | 62.36 |
Median Absolute Deviation (MAD) | 20 |
Skewness | 3.8996276 |
Sum | 6236 |
Variance | 5179.6469 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
22 | 6 | 6.0% |
13 | 4 | 4.0% |
40 | 4 | 4.0% |
20 | 4 | 4.0% |
25 | 4 | 4.0% |
33 | 2 | 2.0% |
119 | 2 | 2.0% |
56 | 2 | 2.0% |
111 | 2 | 2.0% |
41 | 2 | 2.0% |
Other values (56) | 68 |
Value | Count | Frequency (%) |
8 | 1 | 1.0% |
11 | 1 | 1.0% |
13 | 4 | |
14 | 2 | |
15 | 1 | 1.0% |
16 | 1 | 1.0% |
17 | 2 | |
18 | 1 | 1.0% |
19 | 2 | |
20 | 4 |
Value | Count | Frequency (%) |
471 | 1 | |
467 | 1 | |
200 | 1 | |
193 | 1 | |
173 | 1 | |
156 | 1 | |
142 | 1 | |
137 | 1 | |
129 | 1 | |
126 | 1 |
ALP_DCT
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 932.0 B |
ALP_SRC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 68 |
---|---|
Distinct (%) | 68.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 85.97 |
Minimum | 28 |
---|---|
Maximum | 263 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 28 |
---|---|
5-th percentile | 38.85 |
Q1 | 57 |
median | 75 |
Q3 | 95.25 |
95-th percentile | 195.5 |
Maximum | 263 |
Range | 235 |
Interquartile range (IQR) | 38.25 |
Descriptive statistics
Standard deviation | 45.475591 |
---|---|
Coefficient of variation (CV) | 0.52897047 |
Kurtosis | 3.4461141 |
Mean | 85.97 |
Median Absolute Deviation (MAD) | 20 |
Skewness | 1.8030196 |
Sum | 8597 |
Variance | 2068.0294 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
59 | 4 | 4.0% |
75 | 4 | 4.0% |
57 | 3 | 3.0% |
64 | 3 | 3.0% |
50 | 3 | 3.0% |
94 | 3 | 3.0% |
55 | 3 | 3.0% |
54 | 2 | 2.0% |
112 | 2 | 2.0% |
87 | 2 | 2.0% |
Other values (58) | 71 |
Value | Count | Frequency (%) |
28 | 1 | |
32 | 1 | |
33 | 1 | |
34 | 1 | |
36 | 1 | |
39 | 1 | |
41 | 1 | |
43 | 1 | |
46 | 2 | |
48 | 2 |
Value | Count | Frequency (%) |
263 | 1 | |
232 | 1 | |
216 | 1 | |
211 | 1 | |
205 | 1 | |
195 | 1 | |
187 | 1 | |
165 | 1 | |
162 | 1 | |
145 | 1 |
GTP_DCT
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 932.0 B |
GTP_SRC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 89 |
---|---|
Distinct (%) | 89.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 280.804 |
Minimum | 14 |
---|---|
Maximum | 1928 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 14 |
---|---|
5-th percentile | 24 |
Q1 | 69.75 |
median | 140.5 |
Q3 | 328.25 |
95-th percentile | 889 |
Maximum | 1928 |
Range | 1914 |
Interquartile range (IQR) | 258.5 |
Descriptive statistics
Standard deviation | 320.02431 |
---|---|
Coefficient of variation (CV) | 1.1396715 |
Kurtosis | 6.674795 |
Mean | 280.804 |
Median Absolute Deviation (MAD) | 99.85 |
Skewness | 2.2227236 |
Sum | 28080.4 |
Variance | 102415.56 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
108.0 | 3 | 3.0% |
27.0 | 2 | 2.0% |
103.0 | 2 | 2.0% |
24.0 | 2 | 2.0% |
95.0 | 2 | 2.0% |
307.0 | 2 | 2.0% |
97.0 | 2 | 2.0% |
54.0 | 2 | 2.0% |
82.0 | 2 | 2.0% |
87.0 | 2 | 2.0% |
Other values (79) | 79 |
Value | Count | Frequency (%) |
14.0 | 1 | |
18.0 | 1 | |
19.0 | 1 | |
20.0 | 1 | |
24.0 | 2 | |
26.0 | 1 | |
27.0 | 2 | |
35.0 | 1 | |
39.0 | 1 | |
40.0 | 1 |
Value | Count | Frequency (%) |
1928.0 | 1 | |
1104.0 | 1 | |
1088.0 | 1 | |
1083.0 | 1 | |
965.0 | 1 | |
885.0 | 1 | |
811.0 | 1 | |
792.0 | 1 | |
786.0 | 1 | |
732.0 | 1 |
RID | AST_SRC | ALT_SRC | ALP_SRC | GTP_SRC | |
---|---|---|---|---|---|
RID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
AST_SRC | 1.000 | 1.000 | 0.903 | 0.521 | 0.543 |
ALT_SRC | 1.000 | 0.903 | 1.000 | 0.000 | 0.488 |
ALP_SRC | 1.000 | 0.521 | 0.000 | 1.000 | 0.696 |
GTP_SRC | 1.000 | 0.543 | 0.488 | 0.696 | 1.000 |
AST_SRC | ALT_SRC | ALP_SRC | GTP_SRC | |
---|---|---|---|---|
AST_SRC | 1.000 | 0.611 | 0.584 | 0.719 |
ALT_SRC | 0.611 | 1.000 | 0.282 | 0.441 |
ALP_SRC | 0.584 | 0.282 | 1.000 | 0.493 |
GTP_SRC | 0.719 | 0.441 | 0.493 | 1.000 |
RID | AST_DCT | AST_SRC | ALT_DCT | ALT_SRC | ALP_DCT | ALP_SRC | GTP_DCT | GTP_SRC | |
---|---|---|---|---|---|---|---|---|---|
0 | R0000001 | 2011-05-15 00:00:00 | 39 | 2011-05-15 00:00:00 | 41 | 2011-05-15 00:00:00 | 57 | 2011-05-15 00:00:00 | 27.0 |
1 | R0000002 | 2011-09-23 00:00:00 | 22 | 2011-09-23 00:00:00 | 22 | 2011-09-23 00:00:00 | 39 | 2011-09-23 00:00:00 | 41.0 |
2 | R0000003 | 2016-01-05 00:00:00 | 138 | 2016-01-05 00:00:00 | 61 | 2016-01-05 00:00:00 | 211 | 2016-01-05 00:00:00 | 594.0 |
3 | R0000005 | 2017-10-24 00:00:00 | 75 | 2017-10-24 00:00:00 | 70 | 2017-10-24 00:00:00 | 65 | 2017-10-24 00:00:00 | 246.0 |
4 | R0000007 | 2010-06-04 00:00:00 | 19 | 2010-06-04 00:00:00 | 20 | 2010-06-04 00:00:00 | 55 | 2010-06-04 00:00:00 | 18.0 |
5 | R0000008 | 2012-11-29 00:00:00 | 19 | 2012-11-29 00:00:00 | 11 | 2012-11-29 00:00:00 | 59 | 2012-11-29 00:00:00 | 24.0 |
6 | R0000009 | 2012-12-27 00:00:00 | 43 | 2012-12-27 00:00:00 | 117 | 2012-12-27 00:00:00 | 54 | 2012-12-27 00:00:00 | 118.0 |
7 | R0000011 | 2009-05-27 00:00:00 | 71 | 2009-05-27 00:00:00 | 173 | 2009-05-27 00:00:00 | 79 | 2009-05-27 00:00:00 | 307.0 |
8 | R0000012 | 2016-06-27 00:00:00 | 47 | 2016-06-27 00:00:00 | 29 | 2016-06-27 00:00:00 | 124 | 2016-06-27 00:00:00 | 236.0 |
9 | R0000013 | 2016-12-14 00:00:00 | 21 | 2016-12-14 00:00:00 | 34 | 2016-12-14 00:00:00 | 55 | 2016-12-14 00:00:00 | 97.0 |
RID | AST_DCT | AST_SRC | ALT_DCT | ALT_SRC | ALP_DCT | ALP_SRC | GTP_DCT | GTP_SRC | |
---|---|---|---|---|---|---|---|---|---|
90 | R0000109 | 2012-09-08 00:00:00 | 1085 | 2012-09-08 00:00:00 | 467 | 2012-09-08 00:00:00 | 88 | 2012-09-08 00:00:00 | 467.0 |
91 | R0000110 | 2016-11-15 00:00:00 | 26 | 2016-11-15 00:00:00 | 56 | 2016-11-15 00:00:00 | 82 | 2016-11-15 00:00:00 | 259.0 |
92 | R0000111 | 2014-11-26 00:00:00 | 48 | 2014-11-26 00:00:00 | 13 | 2014-11-26 00:00:00 | 96 | 2014-11-26 00:00:00 | 432.0 |
93 | R0000112 | 2013-12-11 00:00:00 | 187 | 2013-12-11 00:00:00 | 110 | 2013-12-11 00:00:00 | 232 | 2013-12-11 00:00:00 | 207.0 |
94 | R0000113 | 2018-01-20 00:00:00 | 102 | 2018-01-20 00:00:00 | 49 | 2018-01-20 00:00:00 | 117 | 2018-01-20 00:00:00 | 1104.0 |
95 | R0000114 | 2010-08-19 00:00:00 | 92 | 2010-08-19 00:00:00 | 126 | 2010-08-19 00:00:00 | 104 | 2010-08-19 00:00:00 | 732.0 |
96 | R0000115 | 2012-10-17 00:00:00 | 43 | 2012-10-17 00:00:00 | 20 | 2012-10-17 00:00:00 | 48 | 2012-10-17 00:00:00 | 66.0 |
97 | R0000116 | 2016-03-29 00:00:00 | 236 | 2016-03-29 00:00:00 | 193 | 2016-03-29 00:00:00 | 78 | 2016-03-29 00:00:00 | 1088.0 |
98 | R0000118 | 2015-10-27 00:00:00 | 78 | 2015-10-27 00:00:00 | 39 | 2015-10-27 00:00:00 | 145 | 2015-10-27 00:00:00 | 811.0 |
99 | R0000119 | 2018-06-04 | 27 | 2018-06-04 | 43 | 2018-06-04 | 49 | 2018-06-04 | 108.0 |